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Methods of screening based on the EGF receptor crystal structure 

Field of the Invention 

This invention relates to the structure of members of the epidermal 
5 growth factor (EGF) receptor family and to receptor/ligand interactions. In 
particular, it relates to the field of using the EGF receptor family structure to 
select and screen for compounds that inhibit the formation of active receptor 
dimers. 

10 Background of the Invention 

Epidermal growth factor is a small polypeptide growth factor that 
stimulates marked proliferation of epithelial tissues and is a member of a larger 
family of structurally related growth factors such as transforming growth factor 
a (TGFa), amphiregulin, betacellulin, heparin-binding EGF and some viral gene 

15 products. Abnormal EGF family signalling is a characteristic of certain cancers 
(Yarden and Sliwkowski, 2001 , Nature Reviews Mol Cell Biol. 2, 127-37; Soler 
and Carpenter, 1994 In Nicola, N. (ed) "Guidebook to Cytokines and their 
Receptors", Oxford Univ. Press, Oxford, pp194-197; Walker and Burgess, 
1994, In Nicola, N. (ed) "Guidebook to Cytokines and their Receptors", Oxford 

20 Univ. Press, Oxford, pp1 98-201). 

The epidermal growth factor receptor (EGFR) is the cell membrane 
receptor for EGF (Ullrich and Schlessinger, 1990, Cell 61, 203-212). The 
EGFR also binds other ligands that contain amino acid sequences classified as 
the EGF-like motif. Other known ligands of the EGFR are amphiregulin 

25 (Shoyab et al.,1988, Proc Natl Acad Sci USA. 85: 6528-6532. ; Shoyab et al., 
1989, Science. 243: 1074-1076.), heparin-binding epidermal growth factor 
receptor (Higashiyama et al., 1991, Science. 251: 936-939.), betacellulin 
(Sasada et al., 1993, Biochem Biophys Res Commun. 190: 1 173-1 179; Shing 
et al.,1993, Science. 259: 1604-1607.), epiregulin (Toyoda et al., 1995, J Biol 

30 Chem. 270: 7495-7500; Toyoda et al., 1 997, Biochem J. 326: 69-75.) and 

epigen (Strachan et al., 2001, J Biol Chem. 276: 18265-18271). Among these 
ligands, the three-dimensional structures of EGF and TGFa have been 
determined by NMR (Montelione et al., 1986 PNAS 83(22): 8594-8; Campbell 
et al., 1989, Prog. Growth Factor Res. 1, 13-22). Upon binding of the ligand to 

35 the extracellular domain, the EGFR undergoes dimerization, which eventually 
leads to the activation of its cytoplasmic protein tyrosine kinase (Ullrich and 
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Schlessinger, 1990, Cell 61, 203-212). The EGFR is also known as the ErbB-1 
receptor and belongs to the type I family of receptor tyrosine kinases (Ullrich, 
and Schlessinger, 1990, Cell 61 , 203-212). This group also includes the ErbB- 
2, ErbB-3 and ErbB-4 receptors. No high affinity ligand has yet been found for 
ErbB-2 (Olayioye et al., 2000, EMBO J. 19: 3159-3167.). The neuregulins are 
alternatively spliced proteins from one of at least four genes which contain an 
EGF-motif and bind to ErbB-3 and/or ErbB-4 (Olayioye et al., 2000, EMBO J. 
19: 3159-3167). One of the neuregulins known as heregulin-1a or NDF was 
found to fold into an EGF-like fold by NMR (Nagata et al., 1994, EMBO J. 13, 
3517-3523 and Jacobson et al., 1996, Biochemistry 36, 3402-3417). The 
EGFR ligands epiregulin, betacellulin and heparin-binding epidermal growth 
factor receptor also bind to ErbB-4 (Olayioye et al., 2000, EMBO J. 19: 3159- 
3167.) 

The type II family of receptor tyrosine kinases consists of the insulin 
receptor (INSR), the insulin-like growth factor I receptor (IGF-1), and the insulin 
receptor-related receptor (Ullrich and Schlessinger, 1990, Cell 61, 203-212). 
Although the type II receptors consist of four chains (0C2P2), both the 
extracellular portions of the receptors from the two families, as well as the 
tyrosine kinase portions, share significant sequence homology, suggesting a 
common evolutionary origin (Ullrich and Schlessinger, 1990, Cell 61, 203-212, 
and Bajaj et al., 1987, Biochim. Biophys. Acta 916, 220-226). 

The 621 amino acid residues of the extracellular domain of the human 
EGFR (sEGFR) can be subdivided into four domains as follows: L1, S1, L2 and 
S2, where L and S stand for "large" and "small" domains, respectively (Bajaj et 
al., 1987, Biochim. Biophys. Acta 916, 220-226, see Fig. 2). The L1 and L2 
domains are homologous, as are the S1 and S2 domains. 

Ligand-induced dimerization was first reported for the EGF receptor 
(Schlessinger, 1980, Trends Biochem Sci 13, 443-447) and now is widely 
accepted as a general mechanism for the transmission of growth stimulatory 
signals across the cell membrane. Although many biochemical experiments 
have been performed to reveal the molecular mechanism of receptor 
dimerization (Lemmon et al., 1997, EMBO J. 16, 281-294 and Tzabar et al., 
1997, EMBO J. 16, 4938-4950 and Lax et al., 1991, J. Biol. Chem. 266, 13828- 
13833), the molecular mechanism by which monomeric ligands induce 
dimerization is still unknown for members of the EGFR family. Single particle 
averaging of electron microscopic images suggests that the overall shape of 
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the sEGFR is four-lobed and doughnut-like (Lax et al., 1991 , J. Biol. Chem. 
266, 13828-13833). Small angle x-ray scattering also indicates that the sEGFR 
can be approximated by a flattened sphere with long diameters of 1 10 A and a 
short diameter of 20 A (Lemmon et al., 1997, EMBO J. 16, 281-294). The 

5 crystallization of sEGFR in complex with EGF has been published (Gunther et 
al., 1990, J. Biol. Chem. 265, 22082-22085; Degenhardtet al., 1998, Acta 
Crystallogr. D Biol. Crystallogr. 54:999-1001), but the structure has not yet 
been reported, despite a decade of effort by many groups. 

One EGF receptor ligand, TGF-oc has been observed to be overproduced 

10 in keratinocyte cells which are subject to psoriasis (Turbitt et al., 1 990, J. 
Invest. Dermatol. 95(2), 229-232; Higashimyama etal., 1991, J. Dermatol., 
18(2), 117-119; Elder et al, 1990, 94(1), 19-25). The overproduction of at least 
one other EGF receptor ligand, amphiregulin, has also been implicated in 
psoriasis. (Piepkorn, 1996, Am. J. Dermatopath., 18(2), 165-171). Molecules 

15 that inhibit the EGF receptor have been shown to inhibit the proliferation of both 
normal keratinocytes (Dvir et al, 1991, J. Cell Biol., 113(4), 857-865) and 
psoriatic keratinocytes. (Ben-Bassat et al., 1995, Exp. Dermatol., 4(2), 82-88). 
These findings indicate that EGF receptor antagonists may be useful in the 
treatment of psoriasis. 

20 Many cancer cells express constitutively active EGFR (Sandgreen et al., 

1990, Cell, 61:1121-135; Karnes etal., 1992, Gastroenterology, 102:474-485) 
or other EGFR family members (Hynes,1993, Semin. Cancer Biol. 4:19-26). 
Elevated levels of activated EGFR occur in bladder, breast, lung and brain 
tumours (Harris, et al., 1989, In Furth & Greaves (eds) The Molecular 

25 Diagnostics of human cancer. Cold Spring Harbor Lab. Press, CSH, NY, 

pp353-357). Antibodies to EGFR can inhibit ligand activation of EGFR (Sato et 
al., 1983 Mol. Biol. Med. 1 :51 1-529) and the growth of many epithelial cell lines 
(Aboud-Pirak et al., 1988, J. Natl Cancer Inst. 85:1327-1331). Patients 
receiving repeated doses of a humanised chimeric anti-EGFR monoclonal 

30 antibody (Mab) showed signs of disease stabilization. The large doses 
required and the cost of production of humanised Mab is likely to limit the 
application of this type of therapy. These findings indicate that the 
development of EGF receptor antagonists will be attractive anticancer agents. 
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Summary of the Invention 

The present inventors have now obtained three-dimensional structural 
information concerning a complex of human epidermal growth factor receptor 
(EGFR) residues 1-501 with human TGFa. In the complex each ligand only 

5 contacts one receptor and each receptor fragment contacts only one ligand. 
The receptor dimer seen in the crystals is a back-to-back dimer (S1 to S1 ). 
The co-ordinates for the EGF receptor in back-to-back dimer configuration are 
shown in Appendix I and Appendix II. Appendix II is a refined version of the co- 
ordinates presented in Appendix I. 

o The information presented in this application can be used to predict the 

structure of related members of the EGF receptor family and the nature of the 
dimers formed by these receptors. This information can be used to develop 
compounds which interact with members of the EGF receptor family for use in 
therapeutic applications. 

5 Accordingly, in a first aspect the present invention provides a method of 

selecting or designing a compound that interacts with a receptor of the EGF 
receptor family and modulates an activity associated with the receptor, the 
method comprising 

(a) assessing the stereochemical complementarity between the 

0 compound and a topographic region of the receptor, wherein the receptor 
comprises: 

(i) amino acids 1-501 of the EGF receptor positioned at atomic 
coordinates as shown in Appendix I or Appendix II, or structural 
coordinates having a root mean square deviation from the 

5 backbone atoms of said amino acids of not more than 1.5A; 

(ii) one or more subsets of said amino acids related to the coordinates 
shown in Appendix I or Appendix II by whole body translations 
and/or rotations; or 

(iii) amino acids present in the amino acid sequence of a receptor of 
o the EGF receptor family, which form an equivalent three- 
dimensional structure to that of amino acids 1-501 of the EGF 
receptor positioned at atomic coordinates substantially as shown in 
Appendix I or Appendix II, or structural coordinates having a root 
mean square deviation from the backbone atoms of said amino 

is acids of not more than 1 .5A, or one or more subsets thereof, 
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(b) obtaining a compound which possesses stereochemical 
complementarity to a topographic region of the receptor; and 

(c) testing the compound for its ability to modulate an activity 
associated with the receptor. 

5 In a preferred embodiment of the first aspect, the structural coordinates 

have a root mean square deviation from the backbone atoms of said amino 
acids of not more than 1 .OA and more preferably not more than 0.7A. 

In one embodiment of the first aspect, the subset of amino acids is 
selected from the group consisting of the subset of amino acids representing 

10 the L1 domain, the subset of amino acids representing the L2 domain and the 
subset of amino acids representing the S1 domain. 

In another embodiment, the subset of amino acids relates to a semi-rigid 
domain within the EGF receptor, such as a domain based on or about residues 
1-84; 191-237; 238-271; 271-284; 285-305 or 313-501; or an equivalent 

15 domain of another member of the EGF receptor family. 

By "stereochemical complementarity" we mean that the compound or a 
portion thereof makes a sufficient number of energetically favourable contacts 
with the receptor as to have a net reduction of free energy on binding to the 
receptor. 

20 From the information provided in Appendix I and Appendix II it can be 

seen that TGFa interacts with residues 1-501 of EGFR such that residues 3-5, 
22, 24, 26, 27, 29-34, 36, 38-41, 43, 44, 47 and 49 of TGFa interact with 
residues 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127 
and 128 of L1 of EGFR and residues 8, 9, 1 1-15, 17, 18, 38, 39, 42 and 44-50 

25 of TGFa interact with residues 325, 346, 348-350, 353-358, 382, 384, 408, 409, 
41 1, 412, 415, 417, 418, 438, 440, 465 and 467 of L2 of EGFR. 

Two residues or groups of residues are taken to "interact" when the 
solvent accessible surface calculated for one set of residues is reduced if it is 
recalculated in the presence of the other set of residues. The solvent 

30 accessible surface is defined by Lee. B and Richards, F. M. (1971) J. Mol. Biol. 
55:379-400 using a probe radius of 1 .4 A. 

The ligand binding surfaces of EGFR are therefore defined by residues 
11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127 and 128 of 
L1 and residues 325, 346, 348-350, 353-358, 382, 384, 408, 409, 41 1 , 412, 

35 41 5, 41 7, 41 8, 438, 440, 465 and 467 of L2. It is believed that corresponding 
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regions of other members of the EGF receptor family will also be involved in the 

binding of their natural ligand. 

Accordingly, in one embodiment of the first aspect the compound is 

selected or designed to interact with a member of the EGF receptor family in a 
5 manner such as to interfere with the binding of natural ligand to:- 

(i) one or more of the residues of EGFR selected from the group 

consisting of 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 

127, 128, 325, 346, 348-350, 353-358, 382, 384, 408, 409, 411, 412, 415, 417, 

41 8, 438, 440, 465 and 467 and combinations thereof; or 
10 (ii) the corresponding region of other members of the EGF receptor 

family. 

The compound may interfere with ligand binding to one or more of the 
specified residues in a number of ways. For example the compound may bind 
or interact with the receptor at or near one or more of the specified residues or 
corresponding regions and by steric overlap and/or electrostatic repulsion 
prevent natural ligand binding. Alternatively the compound may bind to the 
receptor so as to interfere allosterically with natural ligand binding. For 
example the compound may bind to the L1. and L2 domains in manner such as 
to decrease the "gap" between the L1 and L2 domains thereby preventing 
access of the ligand to one or more of the specified residues. 

Alternatively the compound may bind to the receptor so as to interfere 
allosterically with natural ligand binding. For example: - 

(i) The compound may bind to the L1 and L2 domains in manner such 
as to decrease the "gap" between the L1 and L2 domains thereby preventing 
access of the ligand to one or more of the specified residues. 

(ii) The compound may bind at-or near the interface between S1 and 
either L1 or L2 domains to thereby perturb the domain associations as shown 
in Appendix I and II for the signalling competent ligand-receptor complex. 

(iii) The compound may bind at a site remote from the ligand-binding 
site but disturb the receptor structure so as to reduce the affinity of ligand 
binding. 

Sites for allosteric interference lie within 5 A of atomic positions listed in 
Appendices III and IV. 

It is presently preferred, however, that the compound binds or interacts 
with the receptor at or near one or more of the specified residues or within the 
corresponding region. 
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Accordingly in one embodiment of the first aspect, the receptor is EGFR 
and topographic region of EGFR to which the compound has stereochemical 
complementarity is the ligand binding surface defined by amino acids 11-18, 
20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127 and 128, and/or 
5 the ligand binding surface defined by amino acids 325, 346, 348-350, 353-358, 
382, 384, 408, 409, 41 1, 412, 415, 417, 418, 438, 440, 465 and 467. 

The phrase "EGF receptor family" includes, but is not limited to, the 
EGF receptor, ErbB2, ErbB3 and ErbB4. In general, EGF receptor family 
molecules show similar domain arrangements and share significant sequence 
10 identity, preferably at least 40% identity. 

The known natural ligands for these receptors are as follows: 
EGFR EGF, TGFa, amphiregulin, betacellulin, epiregulin and 

heparin-binding EGF; 
ErbB3 neuregulins 1 and 2; 

15 ErbB4 neuregulins 1-4, betacellulin, epiregulin and 

heparin-binding EGF; 
ErbB2 ErbB2 alone has not been reported to bind any ligand with 

high affinity but is preferred heterodimerisation partner for 
the other three EGF receptor family members, enhancing 
20 their affinities for their respective ligands and amplifying 

their signals. 

The domain structure of the extracellular regions of the EGFR, ErbB-2, 
ErbB-3 and ErbB-4 are the same. The percentage identities of the sequences 
corresponding to the first 501 residues of the EGFR are 42-47 % except for that 

25 for ErbB-3 and ErbB-4 which is 60 %. Previously, it has been possible to 
construct models of ErbB-2, ErbB-3 and ErbB-4 based on the structure of the 
first three domains of the insulin-like growth factor receptor (Garrett et al., 
(1998) Nature. 394: 395-399.) as has been performed for the EGFR (Jorissen 
et al., (2000) Protein Sci. 9: 310-324.) where the sequence identity is 

30 approximately 25%. At the higher sequence identity between EGFR and the 
other EGFR family members, models can be constructed which are expected to 
have a smaller degree of error (Tramontano A. (1998) Methods. 14: 293-300). 

A sequence alignment between the four EGFR family members is shown 
in Figure 1 . Using the information provided in Appendix I Appendix II and the 

35 sequence alignment models of other members of the EGF receptor family can 
be obtained using the methods described in the reference referred to above. 
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The structure of the TGFa- EGFR complex also allows construction of 
the binding of EGFR family ligands to be modelled. Several interactions 
between TGFa and the sEGFR501 suggest that the observed mode of binding 
is the same for the EGFR family members and their ligands. There are two 
5 mainchain-to-mainchain hydrogen bonds between the EGFR L1 domain and 
TGFa :EGFR Gin 16.N - TGFa Cys 32.0 and Gin 16.0 - TGFa Cys 34.N. The 
sidechain of conserved TGFa residue Arg 42 forms a salt bridge with the 
sidechain of conserved EGFR residue Asp 355. 

The sequence alignment of ligands for EGF receptor family is set out in 
10 Figure 2. 

The approximate ligand binding regions of ErbB-2, ErbB-3 and ErbB-4 
can be deduced using the alignment of their sequences to that of the EGFR 
(Figure 1) and the EGFR sequences listed earlier (residues 11-18, 20, 22, 26, 
29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127, 128, 325, 346, 348-350, 353- 

15 358, 382, 384, 408, 409, 411,412,415, 41 7, 41 8, 438, 440, 465 and 467). For 
ErbB-2 (whose N-terminal sequence is taken to be STQV), these residues are 
9-16, 18, 20, 24, 27, 28, 43, 67, 87, 88, 96, 97, 99-101, 133, 135, 136, 333, 
354, 359-358, 361-366, 390, 392, 416, 417, 419, 420, 423, 425, 426, 446, 448, 
473 and 475. For ErbB-3 (whose N-terminal sequence is taken to be SEVG), 

20 these residues are 14-21 , 23, 25, 29, 32, 33, 48, 72, 92, 93, 101 , 102, 104-106, 
129, 131, 132, 322, 343, 345-347, 350-355, 379, 381, 405, 406, 408, 409, 412, 
414, 415, 436, 438, 464 and 466. For ErbB-4 (whose N-terminal sequence is 
taken to be QPSD), these residues are 13-20, 22, 24, 28, 31, 32, 47, 71, 91, 
92, 100, 101, 103-105, 128, 130, 131, 326, 347, 349-351, 354-359, 383, 385, 

25 409, 410, 411, 412, 415, 417, 418, 439, 441, 466 and 468. (Note that the N- 
termini correspond to the putative start of the mature proteins according to their 
entries in the SWISSPROT database at the time of writing.) There are 
expected to be minor differences in the amino acids of the EGFR family 
member (including EGFR) which make up the ligand binding site depending on 

30 the identity of the ligand and receptor. For example, the EGFR residue Gly 442 
is not listed as part of the binding site for bound TGFa but has been implicated 
in the binding of EGF (Elleman et al., (2001) Biochemistry. 40: 8930-8939.). A 
comparative model of the EGF - EGFR 1-501 complex shows that part of the 
sidechain of EGF residue Arg 45 is close to EGFR Gly 442. (The small size of 

35 the TGFa Ala 46 sidechain prevents this contact in the TGFa - bound complex.) 
Other variations in the definition of the ligand binding site for the modelled 



WO 03/014159 



PCT/AU02/01042 



9 

EGFR family member - ligand complex may arise from the variation in the size 
of the so-called B-loop of some of the EGFR family ligands (Groenen et al., 
(1994) Growth FactorsAV. 235-257.). 

In a preferred embodiment of the first aspect of the present invention, the 

5 method comprises selecting or designing a compound which has portions that 
match residues positioned on the ligand binding surface of EGFR defined by 
amino acids 11-18, 20, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127 
and 128, and/or the ligand binding surface of EGFR defined by amino acids 
325, 346, 348-350, 353-358, 382, 384, 408, 409, 411, 412, 415, 417, 418, 438 

10 and 465, or the corresponding regions of other members of the EGF receptor 
family. 

By "match" we mean that the identified portions interact with the surface 
residues, for example, via hydrogen bonding or by enthalpy-reducing Van der 
Waals and Coulomb interactions which promote desolvation of the biologically 
15 active compound with the receptor, in such a way that retention of the 
compound by the receptor is favoured energetically. 

In a further preferred embodiment of the first aspect, the stereochemical 
complementarity between the compound and the receptor is such that the 
compound has a Kd for the receptor site of less than 10" 6 M, more preferably 
20 the Kd value is less than 10" 8 M and more preferably less than 10" 9 M. 

In preferred embodiments of the first aspect of the present invention, the 
compound is selected or modified from a known compound identified from a 
data base. 

A second aspect of the present invention provides a method of selecting 
25 or designing a compound that inhibits the formation of active dimers of 
receptors of the EGF receptor family, the method comprising: 

(a) assessing the stereochemical complementarity between the 
compound and a topographic region of the receptor, wherein the receptor 
comprises: 

30 (i) amino acids 1 -501 of the EGF receptor positioned at atomic 

coordinates as shown in Appendix I or Appendix II, or structural 
coordinates having a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1 .5A; 
(ii) one or more subsets of said amino acids related to the coordinates 

35 shown in Appendix I or Appendix II by whole body translations 

and/or rotations; or 
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amino acids present in the amino acid sequence of a receptor of 
the EGF receptor family, which form an equivalent three- 
dimensional structure to that of amino acids 1-501 of the EGF 
receptor positioned at atomic coordinates substantially as shown in 
5 Appendix I or Appendix II, or structural coordinates having a root 

mean square deviation from the backbone atoms of said amino 
acids of not more than 1 .5A, or one or more subsets thereof, 
(b) obtaining a compound which possesses stereochemical 

complementarity to a topographic region of the receptor; and 
10 (c) testing the compound for its ability to inhibit the formation of active 

dimers of the receptors. 
From the information provided in Appendix I and Appendix II it can also 
be seen that in the EGF dimer residues 38, 86, 194, 195, 204, 205, 230, 239, 
242-246, 248-253, 262-265, 275, 278-280, 282-288 and 318 of the first 
15 receptor of the dimer interact with residues 86, 1 93, 1 94, 204, 205, 229, 230, 
239, 242, 244-246, 248-253, 262-265, 275, 278-280 and 282-287 of the second 
receptor of the dimer. It is believed that corresponding regions of other 
members of the EGF receptor family will also be involved in the formation of 
active dimers. 

20 Accordingly, in a further preferred form the compound is selected or 

designed to interact with a member of the EGF receptor family in a manner 
such as to interfere with the formation of active dimers by inhibiting interaction 
of; 

(i) residues 38, 86, 1 94, 1 95, 204, 205, 230, 239, 242-246, 248-253, 
25 262-265, 275, 278-280, 282-288 and 31 8 of EGFR or the corresponding region 

of a member of the EGF receptor family; 
with 

(ii) residues 86, 193, 194, 204, 205, 229, 230, 239, 242, 244-246, 248- 
253, 262-265, 275, 278-280 and 282-287 of EGFR or the corresponding region 

30 of a member of the EGF receptor family. 

The compound may interfere with dimerization in a number of ways. For 
example the compound may bind to the EGFR at or near one or more of the 
specified residues and by steric overlap an/or electrostatic repulsion prevent 
dimerization. Alternatively the compound may bind to EGFR so as to interfere 

35 allosterically with dimer formation. 
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Accordingly in one preferred embodiment of the second aspect, the 
receptor is EGFR and the topographic region of the EGFR to which the 
compound, or a portion thereof, has stereochemical complementarity is the 
dimer interface defined by amino acids 38, 86, 194, 195, 204, 205, 230, 239, 

5 242-246, 248-253, 262-265, 275, 278-280, 282-288 and 31 8 and/or the dimer 
interface defined by amino acids 86, 193, 194, 204, 205, 229, 230, 239, 242, 
244-246, 248-253, 262-265, 275, 278-280 and 282-287 . 

The regions of ErbB-2, ErbB-3 and ErbB-4 involved in dimerization can 
also be deduced using the alignment of their sequences to that of the EGFR 

10 (Figure 1 ) and the EGFR sequences listed earlier (residues 38, 86, 1 93-1 95, 
204, 205, 229, 230, 239, 242-246, 248-253, 262-265, 275, 278-280, 282-288, 
318). For ErbB-2 (whose N-terminal sequence is taken to be STQV), these 
residues are 36, 84, 201 - 203, 21 1 , 21 2, 236, 237, 246, 249-253, 255-260, 269- 
272, 282, 285-287, 289-295, 326. For ErbB-3 (whose N-terminal sequence is 

15 taken to be SEVG), these residues are 41 , 89, 1 93-1 95, 204, 205, 229, 230, 
239, 242-246, 248-253, 262-265, 275, 278-279, 281-287, 317. For ErbB-4 
(whose N-terminal sequence is taken to be QPSD), these residues are 40, 88, 
195-197, 206, 207, 231, 232, 241, 244-248, 250-255, 264-267, 277, 280-281, 
283-289, 319. (Note that the N-termini correspond to the putative start of the 

20 mature proteins according to their entries in the SWISSPROT database at the 
time of writing.) 

The mode of dimerization seen in the crystal structure is consistent with 
homodimers and heterodimers of all four EGFR family members. Several 
residues which appear to be important for maintaining the dimer interface in 

25 EGFR are conserved in the EGFR family. The conserved Asn 247 makes 

sidechain-to-mainchain hydrogen bonds which help to maintain the structure of 
the loop which interacts with the other EGFR molecule in the dimer. Residues 
Tyr 251 and Phe 263 are involved in packing interactions across the interface; 
these residues are either tyrosine or phenylalanine in ErbB-2, ErbB-3 and 

30 ErbB-4. The side chain of the conserved residue Tyr 246 makes hydrophobic 
packing and hydrogen bonding interactions with the other EGFR in the dimer. 

As used herein the term "dimer" is intended to cover both homodimers 
and heterodimers. 

By "active dimer" we mean a dimeric form which causes signalling. 

35 In a further embodiment of the second aspect of the present invention, 

the method comprises selecting or designing a compound which has portions 
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that match residues positioned on the dimer interface of EGFR defined by 
amino acids 38, 86, 194, 195, 204, 205, 230, 239, 242-246, 248-253, 262-265, 
275, 278-280, 282-288 and 318 or the corresponding regions of other members 
of the EGF receptor family and/or the dimer interface defined by amino acids 
5 86, 193, 194, 204, 205, 229, 230, 239, 242, 244-246, 248-253, 262-265, 275, 
278-280 and 282-287 or the corresponding regions of other members of the 
EGF receptor family. 

In a preferred embodiment the compound is designed or selected to 
comprise a first domain which interacts with the dimer interface of a first EGF 
10 receptor family member and a second domain which interacts with the dimer 
interface of a second EGF receptor family member. As will be recognised such 
a compound will cross-link receptor and prevent formation of active dimers. 

In a further preferred embodiment of the second aspect of the present 
invention, the stereochemical complementarity is such that the compound has 
15 a Kd for the receptor site of less than 1 0^M. More preferably, the Kd value is 
less than 1 0" 8 M and more preferably less than 1 0" 9 M. 

In preferred embodiments of the second aspect of the present invention, 
the compound is selected or modified from a known compound identified from a 
data base. 

20 The information provided in Appendix I and Appendix II also reveals the 

portions of TGFa which are involved in receptor binding. With this information 
TGFa variants may be designed in which specific residues are modified or 
altered such that the variant retains is able to bind to one ligand binding surface 
but not the other. It would be expected that such a variant would compete with 

25 the natural ligand for binding to the receptor but that binding of the variant to 
the receptor would not lead to signalling. Such a variant would therefore be an 
antagonist. In a similar manner variants which would act as agonists could be 
designed. In this case the modifications or alterations would be selected such 
as to increase the strength of interaction between the receptor and the variant 

30 so as to lead to increased signalling. 

In a similar manner to that described for TGFa , variants of other ligands 
of the EGF receptor family may also be designed. 

Accordingly in a third aspect the present invention consists in a TGFa 
variant in which the sequence of TGFa is modified such that the ability to 

35 interact with L1 of EGFR is retained or increased and the ability to interact with 
L2 of EGFR is removed or decreased, or vice versa. 
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In a fourth aspect the present invention consists in a TGFa variant in 
which the sequence of TGFa is modified such that the ability to interact with L1 
of EGFR is retained or increased and the ability to interact with L2 of EGFR is 
retained or increased, with the proviso that the binding to at least one of L1 or 
5 L2 is increased. 

In a preferred embodiment of these aspects of the present invention the 
TGFa variant is modified at one more of the positions selected from the group 
consisting of 3-5, 8, 9, 11-15, 17, 18, 22, 24, 26, 27, 29-34, 36 and 38-50. 

In a fifth aspect the present invention consists in an EGF variant in which 
10 the sequence of EGF is modified such that the ability to interact with L1 of 
EGFR is retained or increased and the ability to interact with L2 of EGFR is 
removed or decreased, or vice versa. 

In a sixth aspect the present invention consists in an EGF variant in 
which the sequence of EGF is modified such that the ability to interact with L1 
15 of EGFR is retained or increased and the ability to interact with L2 of EGFR is 
retained or increased, with the proviso that the binding to at least one of L1 or 
L2 is increased. 

By "variant" we mean that the natural sequence of EGF or TGFa has 
been modified by one or more point mutations, insertions of amino acids, 

20 deletions of amino acids or replacement of amino acids, in particular using non- 
natural amino acids such as D-isomers of natural amino acids, 2,4- 
diaminobutyric acid, a-amino isobutyric acid, 4-aminobutyric acid, 2- 
aminobutyric acid, 6-amino hexanoic acid, 2-amino isobutyric acid, 3-amino 
propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, 

25 citrulline, homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, 

phenylglycine, cyclohexylalanine, p-alanine, fluoro-amino acids, designer 
amino acids such as (3-methyl amino acids, Ca-methyl amino acids, Na-methyl 
amino acids, p-naphthalimo amino acids and amino acid analogues in general. 
The information provided in Appendix I and Appendix II also reveals the 

30 portions of EGFR which are involved in dimer formation and the portions EGFR 
involved in ligand binding. With this information EGFR variants or fragments 
may be designed in which specific residues are modified or altered such that 
the variant or fragment retains the ability to form dimers with the EGFR and or 
bind ligand. It would be expected that such variant or fragments would 

35 compete with the natural receptors for dimerization or ligand binding but that 
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dimerization of the variant or fragment with the receptor would not lead to 
signalling. 

Accordingly in a seventh aspect the present invention consists in a 
polypeptide, the polypeptide comprising amino acids which interact with amino 

5 acids 38, 86, 1 93-1 95, 204, 205, 229, 230, 239, 242-246, 248-253, 262-265, 
275, 278-280, 282-288, 318 of EGFR or the corresponding region of a member 
of the EGF receptor family, or which are involved in binding of natural ligand of 
the EGF receptor family. 

In a preferred embodiment the polypeptide is based on the native 

10 sequence of EGFR but includes modifications such that the interaction between 
the polypeptide and the native receptor is preferred over the interaction 
between native receptors. 

In a further preferred embodiment the polypeptide is based on the native 
sequence of EGFR but includes modifications such that the interaction between 

15 the polypeptide and the natural ligand is preferred over the interaction between 
the natural ligand and native receptor. 

As will be understood by those skilled in this field knowledge of the 
structure of a protein complex is of assistance in the development of mutants of 
one of the proteins with enhanced affinity for its protein partner. Structural 

20 information can be used to select residues on one or more of the protein 
interfaces in the complex for alteration by methods such as site-directed 
mutagenesis or phage display. For example, amino acid positions in growth 
hormone which were allowed to vary were chosen in part from the crystal 
structure of the complex of growth hormone bound to two molecules of the 

25 human growth hormone extracellular region (Lowman and Wells (1 993) J Mol 
Biol. 234: 564-578.). Using a model of the granulocyte colony-stimulating 
factor (G-CSF) receptor ligand binding domain, residues of the receptor were 
chosen for mutagenesis by analogy with the structure of human growth 
hormone bound to its receptors (Layton et al., (1997) J Biol Chem. 272: 29735- 

30 29741 .). Some of the mutant G-CSF receptors were found to bind G-CSF with 
slightly enhanced affinity (Layton et al., (1997) J Biol Chem. 272: 29735- 
29741 .). The structure of the complex could also be used to design mutations 
which would potentially increase the binding affinity, for example by increasing 
the amount of hydrogen bonds and/or van der Waals interactions across the 

35 interface. 
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The modification of protein residues to enhance protein binding affinity is 
not restricted to those residues in the relevant protein-protein interfaces. 
Modification of residues outside of an interface may lead to alterations due to 
changes in the long-range electrostatic interactions between the two interacting 

5 proteins which changes the rate of association and subsequently the 

equilibrium binding constant (Selzer and Schreiber (1999) J Mol Biol. 287: 409- 
41 9.; Selzer et al., (2000) Nat Struct Biol. 7: 537-541). The contribution of 
mutations to the association rate can be calculated and has been used to 
increase the association rate (without greatly changing the dissociation rate) 

10 and the affinity of p-lactamase inhibitory protein to TEM1 p-lactamase by a 
factor of 250 (Selzer et al., (2000) Nat Struct Biol. 7: 537-541 .). 

There are two proposed modes of antagonist action of appropriate 
extracellular fragments of EGFR family members. The first is ligand binding. 
The SEGFR501 binds EGF and TGFa with approximately 10 times higher 

15 affinity than the full length extracellular portion of the EGFR (Elleman et al., 
(2001 ) Biochemistry. 40: 8930-8939.). The second mode is the association of 
these proteins with full-length receptors. Recombinant forms of the EGFR and 
ErbB-2 which contain only the extracellular domain and transmembrane 
domain are able to inhibit EGF-induced signalling when expressed on cells 

20 which also express the full length EGF receptor (Kashles et al., (1 991 ) Mol Cell 
Biol. 1 1 : 1454-1463; Spivak-Kroizman et al., (1992) J Biol Chem. 267: 8056- 
8063; Qian et al., (1999) J Biol Chem. 274: 574-583.), suggesting that the 
recombinant proteins act in a dominant negative manner which involves their 
extracellular regions. 

25 The structure of the EGFR complex can be used to design mutations for 

extracellular fragments of EGFR family. Structural models of the other EGFR 
family members can be constructed as previously described. Mutations can be 
made either by expressing mutant versions of EGFR 1-501 or its homologues 
in which residues have been mutated individually or as groups, or by using the 

30 structure to locate amino acid positions which can be changed using methods 
such as phage display or DNA shuffling. These mutants can be tested or 
selected for enhanced affinity relative to the extracellular fragment based on 
the wild type EGFR family member's amino acid sequence. The preferred 
EGFR amino acids which are candidates for mutation are as follows:- 
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(i) 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127, 
128, 325, 346, 348-350, 353-358, 382, 384, 408, 409, 41 1, 412, 415, 
417, 418, 438, 440, 465 and 467, or 

(ii) 38, 86, 193-195, 204, 205, 229, 230, 239, 242-246, 248-253, 262- 
5 265, 275, 278-280, 282-288, 31 8. 

The relevant residues for other members of the EGF receptor family can 
be determined from sequence alignments. 

Additionally, the mutation of residues which are outside of the relevant 
binding interface may also alter the binding affinity by changes in the long 

10 range electrostatic interactions. These changes can affect the rate of 

association between two interacting proteins without greatly changing the rate 
of dissociation, and hence change the equilibrium binding constant (Selzer and 
Schreiber (1999) J Mol Biol. 287: 409-419.; Selzer et al., (2000) Nat Struct Biol. 
7: 537-541 .). In one example of increasing the affinity of binding by mutating 

15 residues outside of the protein-protein interface, selected residues of the p- 
lactamase inhibitory protein that were outside of the interface were mutated so 
as to change their charge e.g. a basic residue mutated to a neutral residue and 
then the affinity and rate constants of the mutant binding to TEM1 p-lactamase 
was measured. In one mutant, the change of four amino acids led to an 

20 enhancement of binding by a factor of more 250-fold (Selzer et al., (2000) Nat 
Struct Biol. 7: 537-541 .). In this example, the authors specified a formula which 
predicted the changes in the association constant upon mutation to within a 
factor of two (Selzer et al., (2000) Nat Struct Biol. 7: 537-541 .). In this way, the 
structure of the EGFR or a model of one other EGFR family members could be 

25 used to predict mutations that would likely lead to an enhancement of the rate 
of association of the relevant EGFR family extracellular fragment to its 
interacting protein. Calculation and subsequent visualization of the 
electrostatic isopotentials (e.g. Smith and Treutlein (1998) Protein Sci. 7: 886- 
896.) may assist the selection of residues to mutate in order to increase the 

30 protein's rate of association. The most likely candidate residues for mutation 
are those on the periphery of the interface and those outside of the interface 
but which are within a specified distance of the interacting protein and are not 
completely buried in the L1 or L2 domain (as judged by visual examination). 
Cysteine residues, which are needed for the maintenance of the EGFR 

35 structure were also excluded from the list. For the EGFR, the preferred 
residues are: 
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(i) 5, 6, 8-10, 19, 21-25, 28, 32, 33, 38, 39, 40, 42, 44, 47, 48, 50, 63, 
64, 66, 68, 71, 73, 87, 88, 91-94, 96, 104-107, 109, 123, 130, 131, 151- 
160, 315-324, 326, 328, 329, 331, 332, 343, 344, 351, 359-363, 379, 
380, 385, 387, 388, 394, 404-407, 410, 413, 420, 434-436, 440, 441, 

5 443, 448, 449, 461 -464, 466-468; or 

(ii) 1-6, 8,9, 11, 30, 35, 36, 39, 40, 60, 62-64, 82, 84, 85, 87-89, 94, 118, 
120-122, 148, 187-193, 196-198, 200-203, 209-211, 213, 215, 217-221, 
231-233, 235, 237, 238, 241, 243, 244, 247, 254-261, 266, 268-270, 
272-274, 276, 277, 281, 289-297, 299-301, 303, 304, 311, 312, 314-317, 

10 31 9-323, 335, 340, 342-344, 346, 376, 378-380, 403-41 2, 434, 459. 

The relevant residues for other members of the EGF receptor family can 
be determined from sequence alignments. 

In an eighth aspect the present invention provides computer-assisted 
method for identifying potential compounds able to interact with a member of 
15 the EGF receptor family and thereby modulate an activity mediated by receptor, 
using a programmed computer comprising a processor, an input device, and an 
output device, comprising the steps of. 

(a) inputting into the programmed computer, through the input device, 
data comprising the atomic coordinates of amino acids 1-501 of the 

20 EGF receptor molecule as shown in Appendix I, or structural 

coordinates having a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1 .5A, or one 
or more subsets of said amino acids, or one or more subsets of 
said amino acids related to the coordinates shown in Appendix I by 

25 whole body translations and/or rotations; 

(b) generating, using computer methods, a set of atomic coordinates of 
a structure that possesses stereochemical complementarity to the 
atomic coordinates of amino acids 1-501 of the EGF receptor 
molecule as shown in Appendix I, or structural coordinates having a 

30 root mean square deviation from the backbone atoms of said amino 

acids of not more than 1.5A, or one or more subsets of said amino 
acids, or one or more subsets of said amino acids related to the 
coordinates shown in Appendix I by whole body translations and/or 
rotations, thereby generating a criteria data set; 

35 (c) comparing, using the processor, the criteria data set to a computer 

database of chemical structures; 



f 
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(d) selecting from the database, using computer methods, chemical 
structures which are similar to a portion of said criteria data set; and 

(e) outputting, to the output device, the selected chemical structures 
which are complementary to or similar to a portion of the criteria 

5 data set. 

In a preferred embodiment of the eighth aspect the subset of amino 
acids are the amino acids (i) defining either or both the ligand binding 
surface(s), or (ii) defining dimerization interface. 

In a further preferred embodiment the method is used to identify potential 
10 compounds which have the ability to decrease an activity mediated by the 
receptor. 

In a further preferred embodiment of the eighth aspect, the method 
further comprises the step of selecting one or more chemical structures from 
step (e) which interact with a member of the EGF receptor family in a manner 

15 such as to interfere with the binding of natural ligand to:- 

(i) one or more of the residues of EGFR selected from the group 
consisting of 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 
125, 127, 128, 325, 346, 348-350, 353-358, 382,384, 408, 409, 411, 
412, 415, 417, 418, 438, 440, 465 and 467 and combinations thereof; or 

20 (ii) the corresponding region of other members of the EGF receptor 

family. 

In a further preferred embodiment of the eighth aspect, the method 
further comprises the step of selecting one or more chemical structures from 
step (e) which interact with one or more of the residues of EGFR selected from 

25 the group consisting of amino acids 38, 86, 1 93-1 95, 204, 205, 229, 230, 239, 
242-246, 248-253, 262-265, 275, 278-280, 282-288, 318 or the corresponding 
region of other members of the EGF receptor family. 

In a further preferred embodiment of the eighth aspect, the method 
further comprises the step of obtaining a compound with a chemical structure 

30 selected in steps (d) and (e), and testing the compound for the ability to 
decrease an activity mediated by the receptor. 

The present invention also provides a method of screening of a putative 
compound having the ability to modulate the activity of a molecule of the EGF 
receptor family, comprising the steps of identifying a putative compound by a 

35 method according to the first or third aspects, and testing the compound for the 
ability to increase or decrease an activity mediated by the molecule. In one 
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embodiment, the test is carried out in vitro. Preferably, the in vitro test is a high 
throughput assay. In another embodiment, the test is carried out in vivo. 

In a ninth aspect the present invention provides a computer for 
producing a three-dimensional representation of a molecule or molecular 
5 complex, wherein the computer comprises: 

(a) a machine-readable data storage medium comprising a data 

storage material encoded with machine-readable data, wherein the 
machine readable data comprise the atomic coordinates of amino 
acids 1-501 of the EGF receptor molecule as shown in Appendix I, 
10 or structural coordinates having a root mean square deviation from 

the backbone atoms of said amino acids of not more than 1 .5A, or 
one or more subsets of said amino acids, or one or more subsets of 
said amino acids related to the coordinates shown in Appendix I by 
whole body translations and/or rotations; 
15 (b) a working memory for storing instructions for processing the 

machine-readable data; 

(c) a central-processing unit coupled to the working memory and to the 
machine-readable data storage medium, for processing the 
machine-readable data into the three dimensional representation; 

20 and 

(d) an output hardware coupled to the central processing unit, for 
receiving the three-dimensional representation. 

In a preferred embodiment of the ninth aspect the subset of amino acids 
are the amino acids (i) defining either or both the ligand binding surface(s), or 

25 (ii) defining dimerization interface. 

In a tenth aspect the present invention provides a compound able to 
interact with a member of the EGF receptor family and to modulate an activity 
mediated by the receptor, the compound being obtained by a method according 
to the present invention. 

30 In a preferred embodiment of the tenth aspect, the compound is a mutant 

of the natural ligand of a receptor of the EGF receptor family, where at least 
one mutation occurs in the region of the natural ligand which interacts with the 
receptor. 

In an eleventh aspect the present invention provides a compound which 
35 possesses stereochemical complementarity to a topographic region of a 
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molecule of the EGF receptor family and modulates an activity mediated by the 
molecule, wherein the molecule is characterised by 

(i) amino acids 1-501 of the EGF receptor positioned at atomic 
coordinates as shown in Appendix I, or structural coordinates 
having a root mean square deviation from the backbone atoms of 
said amino acids of not more than 1 .5A; 

(ii) one or more subsets of said amino acids related to the coordinates 
shown in Appendix I by whole body translations and/or rotations, or 

(iii) amino acids present in the amino acid sequence of a member of 
the EGF receptor family, which form an equivalent three- 
dimensional structure to that of the receptor site defined by amino 
acids 1-501 of the EGF receptor positioned at atomic coordinates 
substantially as shown in Appendix I; 

with the proviso that the compound is not a naturally occurring member 
of the EGF receptor family or a mutant thereof. 

By "mutant" we mean a ligand which has been modified by one or more 
point mutations, insertions of amino acids or deletions of amino acids. 

In one embodiment of the eleventh aspect, the topographic region of the 
molecule is defined by is the ligand binding surface defined by amino acids 11- 
18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127 and 128 and/or 
the ligand binding surface defined by amino acids 325, 346, 348-350, 353-358, 
382, 384, 408, 409, 411, 412, 415, 417, 418, 438, 440, 465 and 467 or the 
corresponding regions of a member of the EGF receptor family. 

In another embodiment of the eleventh aspect, the topographic region of 
the EGFR is defined by the dimerization interface defined by amino acids 38, 
86, 193-195, 204, 205, 229, 230, 239, 242-246, 248-253, 262-265, 275, 278- 
280, 282-288, 318. 

In preferred embodiments of the tenth and eleventh aspects, the 
stereochemical complementarity between the compound and the receptor is 
such that the compound has a Kd for the receptor site of less than 10' 6 M, more 
preferably less than 1 0" 8 M. 

In other embodiments of the tenth and eleventh aspects, the compound 
decreases an activity mediated by the EGF receptor. 

In a twelfth aspect, the present invention provides a pharmaceutical 
composition for preventing or treating a disease associated with signaling by a 
molecule of the EGF receptor family which comprises a compound according to 
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the ninth or tenth aspects of the present invention and a pharmaceutical^ 
acceptable carrier or diluent. 

In a thirteenth aspect the present invention provides a method of 
preventing or treating a disease associated with signaling by a molecule of the 

5 EGF receptor family which method comprises administering to a subject in 
need thereof a compound according to the ninth or tenth aspects of the present 
invention. Preferably, the disease is selected from psoriasis and tumour states 
comprising but not restricted to cancer of the breast, brain, colon, prostate, 
ovary, cervix, pancreas, lung, head and neck, and melanoma, 

10 rhabdomyosarcoma, mesothelioma, squamous carcinomas of the skin and 
glioblastoma. 

In a fourteenth aspect, the present invention provides a method for 
evaluating the ability of a chemical entity to bind to EGFR, said method 
comprising the steps of: 
15 (a) creating a computer model of at least one region of EGFR using 

structure coordinates wherein the root mean square deviation 
between said structure coordinates and the structure coordinates of 
amino acids 1-501 of EGFR as set forth in Appendix I or Appendix 
II is not more than about 1 .5 A; 
20 (b) employing computational means to perform a fitting operation 

between the chemical entity and said computer model of the 
binding surface; and 
(c) analysing the results of said fitting operation to quantify the 

association between the chemical entity and the binding surface 
25 model. 

In one embodiment of the fourteenth aspect of the invention the region of 
EGFR is selected from the group consisting of the ligand binding surface 
defined by amino acids 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101- 
103, 125, 127 and 128 and/or the ligand binding surface defined by amino 
30 acids 325, 346, 348-350, 353-358, 382, 384, 408, 409, 41 1, 412, 415, 417, 
418, 438, 440, 465 and 467 348-350, 353-358, 382, 384, 408, 409, 41 1, 412, 
415, 417, 418, 438 and 465 and a combination thereof. 

In another embodiment of the fourteenth aspect the region of EGFR is 
the dimerization interface defined by amino acids 38, 86, 193-195, 204, 205, 
35 229, 230, 239, 242-246, 248-253, 262-265, 275, 278-280, 282-288 and 31 8. 
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In a fifteenth aspect the present invention consists in a polypeptide 
complex in a crystallized form comprising the amino acids 1-501 of EGFR and 
TGFa. 

It will be appreciated that isolated dimers of compounds comprising 
5 extracellular fragments of members of the EGF receptor family (e.g. dimers of 
fragment 1-501 of EGFR) in the back-to-back configuration may be useful 
therapeutic agents given their ability to compete with natural receptors for 
binding to ligands of the EGF receptor family. 

Accordingly, in a sixteenth aspect the present invention provides a 
10 compound comprising fragment 1 -501 of EGFR or an equivalent fragment of a 
member of the EGF receptor family, wherein the fragment is modified to induce 
dimerisation of the fragment in back-to-back configuration. 

In one embodiment, the modification is made to a residue of the 
fragment which forms part of the back-to-back dimer interface. More 
15 preferably, the modification involves substitution of at least one residue which 
forms part of the back to back dimer with a cysteine residue. The substitution 
may be P248C and/or A265C. Alternatively, the substitution may be D279C. 

In another embodiment of the sixteenth aspect, the modification involves 
insertion of a dimerization sequence into the fragment. A "dimerization" 
sequence allows the non-covalent association of one binding domain to 
another, with sufficient affinity to remain associated under normal physiological 
conditions. 

Suitable dimerization domains that can be used in the context of the 
present invention would be known to those skilled in the art, or may be readily 
identified using standard methods such as the yeast two hybrid system and 
traditional biochemical affinity binding studies. For example, an in vivo library- 
versus-library selection of optimized protein-protein interactions is described in 
Pelletier et al., (1999) Nature Biotechnology 17, 683. 

Suitable dimerization sequences may be derived, for example, from Jun 
and Fos, which are sequence specific DNA binding proteins that regulate 
transcription. Each protein has a bipartite DNA-binding domain consisting of an 
amphipathic helix that mediates dimerization through formation of a short coiled 
structure, termed a "leucine zipper". Suitable dimerization pairs for use in the 
present invention may include the leucine zipper of Jun or Fos and a protein 
sequence that reacts with this leucine zipper. A method for identifying 
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mammalian proteins that react with the leucine zipper of Jun is described in 
Chevray & Nathans, (1992) Proc. Natl. Acad. Sci. USA 89, 5789. 

Suitable dimerization sequences for use in the present invention also 
include: 

5 (i) Heterodimeric coiled-coil peptide pairs as described in Arndt et al., 

(2000) J. Mol. Biol. 295, 627; 

(ii) The WW domain and ligands that bind thereto (see Dalby et al., 
(2000) Prot. Sci. 9, 2366); 

(iii) The bacterial nucleoid-associated proteins H-NS and StpA which 
io form homomeric or heteromeric complexes (see Dorman et al., (1999) Trends 

Microbiol. 7, 124); and 

(iv) Antibody domains, such as the first constant domain (Ch1 and Ci_) of 
an lgG1 (see, for example, Mueller et al., (1998) FEBS Lett 422, 259). 

In one embodiment, the dimerization sequence is inserted between 
15 residues 194 and 195 or between residues 204 and 205 of EGFR or equivalent 

residues of another member of the EGF receptor family. 

In yet another embodiment of the sixteenth aspect, the modification 

involves the lengthening of an appropriate loop structure (e.g. a loop within the 

S1 domain) which may then be cross-linked with the corresponding loop or a 
20 different loop of the dimer partner by a linker. The linker may be, for example, 

a disulphide bond. The lengthening of the loop may be achieved, for example, 

by the insertion of additional residues between residues 210 and 21 1 or 

between residues 297 and 298 of EGFR or the equivalent residues of another 

member of the EGF receptor family. 
25 In another embodiment of the sixteenth aspect, the fragment is 

conjugated to a molecule. The molecule may be, for example, a constant 

domain of an immunoglobulin molecule. 

The present invention also encompasses compounds of the sixteenth 

aspect in dimer form. 

30 The information provided in Appendix I and II also shows that there are a 

number of loop structures in the EGFR. From the three dimensional structure 
antibodies directed against these would interfere with binding of the natural 
ligand to the receptor or with the formation of active dimers. 

Accordingly in a seventeenth aspect the present invention consists in an 

35 antibody which binds to EGFR, the antibody being directed against (i) EGFR 
residues 100-108, 315-327 or 353-362; or (ii) EGFR residues 190-207, 240-305 
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or parts thereof or the corresponding regions of a member of the EGF receptor 
family. 

Antibodies of the present invention may be produced, for example, by 
immunizing mice with purified EGFR fragment 1-501. After determining that 
the mice are producing anti-EGFR antibodies, hybridomas may be prepared 
and antibody specificity assayed by ELISA or Flow Cytometry using two cell 
lines: Baf/wt-EGFR cells and Baf/EGFR-"mutation x" cells. These mouse cell 
lines express either the wild type EGFR or the EGFR containing an Ala 
substitution ( ie mutation x) within the specific site against which the antibody is 
to be directed. When hybridomas secreting antibodies which recognize Baf/wt- 
EGFR, but not Baf/EGFR-"mutant x" are identified, the corresponding 
hybridoma may be cloned and the monoclonal antibody purified. 

Alternatively, in raising antibodies of the invention, it may be desirable to 
use derivatives of the peptides or loop structures which are conformationally 
constrained. Conformational constraint refers to the stability and preferred 
conformation of the three-dimensional shape assumed by a peptide. 
Conformational constraints include local constraints, involving restricting the 
conformational mobility of a single residue in a peptide; regional constraints, 
involving restricting the conformational mobility of a group of residues, which 
residues may form some secondary structural unit; and global constraints, 
involving the entire peptide structure. 

The active conformation of the peptide may be stabilized by a covalent 
modification, such as cyclization or by incorporation of gamma-lactam or other 
types of bridges. For example, side chains can be cyclized to the backbone so 
as create a L-gamma-lactam moiety on each side of the interaction site. See, 
generally, Hruby et al., "Applications of Synthetic Peptides," in Synthetic 
Peptides: A User's Guide: 259-345 (W. H. Freeman & Co. 1992). Cyclization 
also can be achieved, for example, by formation of cystine bridges, coupling of 
amino and carboxy terminal groups of respective terminal amino acids, or 
coupling of the amino group of a Lys residue or a related homolog with a 
carboxy group of Asp, Glu or a related homolog. Coupling of the alpha-amino 
group of a polypeptide with the epsilon-amino group of a lysine residue, using 
iodoacetic anhydride, can be also undertaken. See Wood and Wetzel, 1992, 
Int'l J. Peptide Protein Res. 39: 533-39. 

Further the conformation of the peptide analogues may be stabilised by 
including amino acids modified at the alpha carbon atom (eg. a-amino-150- 
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butyric acid) (Burgess and Leach, 1973, Biopolymers 12(12):2691-2712; 
Burgess and Leach, 1973, Biopolymers 12(11):2599-2605) or amino acids 
which lead to modifications on the peptide nitrogen atom (eg. sarcosine or N- 
methylalanine) (O'Donohue et al, 1995, Protein Sci. 4(10):2191-2202). 

5 Another approach described in US 5,891 ,41 8 is to include a metal-ion 

complexing backbone in the peptide structure. Typically, the preferred metal- 
peptide backbone is based on the requisite number of particular coordinating 
groups required by the coordination sphere of a given complexing metal ion. In 
general, most of the metal ions that may prove useful have a coordination 

10 number of four to six. The nature of the coordinating groups in the peptide 
chain includes nitrogen atoms with amine, amide, imidazole, or guanidino 
functionalities; sulfur atoms of thiols or disulfides; and oxygen atoms of 
hydroxy, phenolic, carbonyl, or carboxyl functionalities. In addition, the peptide 
chain or individual amino acids can be chemically altered to include a 

15 coordinating group, such as for example oxime, hydrazino, sulfhydryl, 

phosphate, cyano, pyridino, piperidino, or morpholino. The peptide construct 
can be either linear or cyclic, however a linear construct is typically preferred. 

As will be readily understood by person skilled in this field the methods of 
the present invention provide a rational method for designing and selecting 

20 compounds including antibodies which interact with members of the EGF 
receptor family. In the majority of cases these compounds will require further 
development in order to increase activity. Such further development is routine 
in this field and will be assisted by the structural information provided in this 
application. It is intended that in particular embodiments the methods of the 

25 present invention includes such further developmental steps. 

In yet a further, eighteenth, aspect, the invention provides a method of 
utilizing molecular replacement to obtain structural information about a 
molecule or a molecular complex of unknown structure, comprising the steps 
of: 

30 (i) crystallising said molecule or molecular complex; 

(ii) generating an X-ray diffraction pattern from said crystallized 
molecule or molecular complex; 

(iii) applying at least a portion of the structure coordinates set forth in 
Appendix I or Appendix II to the X-ray diffraction pattern to generate 

35 a three-dimensional electron density map of at least a portion of the 

molecule or molecular complex whose structure is unknown. 



f 
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The term "molecular replacement" refers to a method that involves 
generating a preliminary model of an EGF receptor family member extracellular 
domain crystal whose structure coordinates are unknown, by orienting and 
positioning a molecule whose structure coordinates are known (e.g., EGFR 1- 

5 501 coordinates from Appendix I or Appendix II) within the unit cell of the 

unknown crystal so as best to account for the observed diffraction pattern of the 
unknown crystal. Phases can then be calculated from this model and 
combined with the observed amplitudes to give an approximate Fourier 
synthesis of the structure whose coordinates are unknown. This, in turn, can 

10 be subject to any of the several forms of refinement to provide a final, accurate 
structure of the unknown crystal (Lattman, 1985, Methods in Enzymology 115: 
55-77; M. G. Rossmann, ed., "The Molecular Replacement Method", Int. Sci. 
Rev. Ser., No. 13, Gordon & Breach, New York, 1972). Using the structure 
coordinates of the EGFR 1-501 provided by this invention, molecular 

15 replacement may be used to determine the structural coordinates of a member 
of the EGF receptor family. 

Throughout this specification, the terms "S1" domain and "cys-rich 1" 
("CR1") domain are used interchangeably. Similarly, the terms "S2" domain 
and "cys-rich 2" ("CR2") domain are used interchangeably. 

20 Throughout this specification, the word "comprise", or variations such as 

"comprises" or "comprising", will be understood to imply the inclusion of a 
stated element, integer or step, or group of elements, integers or steps, but not 
the exclusion of any other element, integer or step, or group of elements, 
integers or steps. 
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Brief Description of the Figures 

Figure 1: Structure-based sequence alignment of the EGFR residues 1-501 
and corresponding residues of ErbB-2, ErbB-3 and ErbB-4. 

5 

Figure 2: Sequence alignment of EGF-like domains of ligands of the EGFR 
family. Note that the start and end of some of these domains are not precisely 
defined. The sequences are for the human forms of the proteins except for 
epigen and the EGF-like domain in neuregulin-4 which are the mouse forms of 

10 the respective proteins. Abbreviations: EGF - epidermal growth factor; TGF-a - 
transforming growth factor alpha; HB-EGF - heparin binding epidermal growth 
factor; NRG - neuregulin. There are four known neuregulin genes (NRG1, 
NRG2, NRG3 and NRG4), some of which encode alternatively spliced forms of 
the EGF-like domain. These forms are identified as the a- or p-form of the 

15 EGF-like domain. 

Figure 3. Polypeptide trace for the structure of the 2:2 complex of sEGFR501 
and TGFa back-to-back dimer, comprising receptor molecule A , receptor 
molecule B, TGFa molecule C and TGFa molecule D. The dimer axis lies 
20 vertically, in the page. 

Figure 4. Structure-based sequence alignment of the human EGFR 
ectodomain, human TGFa and related proteins. (A) The receptor L1 and L2 
domains plus the first module of the cys rich regions, S1 and S2. (B) Modules 

25 2 to 8 of the receptor cys rich region S1 and modules 2 to 7 of S2. (C) Human 
TGFa, EGF and heparin binding EGF. Numbers in parentheses show where 
amino acid have been omitted and positions with conserved physicochemical 
properties of amino acids are boxed. Secondary structure elements are 
indicated above the sequences (and below in A), with shading as in Figure 5A. 

30 Also indicated are disulfide bonds and residues buried at protein-protein 

interfaces: L1-TGFa, 1; L2-TGFa, 2.L1-L2 contacts, 3 in A; L1- & L2-TGFa, 3 in 
B; S1 loop, L; residues to which the S1 loop binds, P; other residues in the 
dimer interface, D. Three types of disulfide bonded modules are indicated by 
bars below the sequences and residues not conforming to the S1 pattern are 

35 shaded grey. 
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Figure 5. Comparison of SEGFR501 with the first three domains of IGF-1R. 
Domains 1-3 of IGF-1R are on the left, sEGFR501 as it appears in the complex 
is on the right. For clarity the ligand in the TGFa:sEGFR501 complex is not 
shown. L1 domains are oriented similarly. 

5 

Figure 6. Structure of the ligand: receptor binding surfaces. 
Ribbon representation showing the contacts between SEGFR501 and TGFa 
viewed from the left in Figure 3. Residue numbers for two important residues in 
TGFa are below the side chains. 

10 

Figure 7. Stereoview of the molecule A S1 loop contacts with S1 of molecule B 
in the back-to-back dimer interface. Inter-chain hydrogen-bonds are drawn in 
black along with the hydrogen-bond from AsnA247 which stabilises the loop tip 
conformation. The single letter code and residue number is used for amino 
15 acid residues. The dimer axis lies vertically at the left between H280. 

Figure 8: Functional characterization of EGFR mutants expressed in BaF/3 
cells. (A) Ligand binding by wild type and mutant EGFRs expressed in BaF/3 
cells. Scatchard plots of 1251-EGF binding to clones expressing the wt, E21 A or 

20 ACR1 EGFR were analyzed using the Radlig program to yield estimates of 
receptor affinity. The three cell lines expressed comparable receptor numbers 
as assessed by M2 or 528 antibody binding and FACS analysis. Shown are the 
plots for cold ligand titration assay; identical results were obtained titrating the 
radiolabeled EGF (hot titration). (B) EGF-dependent tyrosine kinase activation. 

25 This was determined in total cell lysates by sequential immunoblotting with anti- 
phosphotyrosine (top) or anti-EGFR (bottom) antibodies. The anti-EGFR 
antibodies have slightly lower affinity for the hyperphosphorylated form of the 
EGFR. The results are representative of multiple experiments on at least four 
independently derived clones for each mutant. (C) Ligand-induced EGFR 

30 dimerization. Cross-linking of the EGFR via the extracellular portion was 

performed at 37°C to maximize dimer yield. Samples were analyzed by SDS- 
PAGE on 3-8% gradient gels and immunoblotting with anti-EGFR antibodies. 
These data are representative of at least four separate experiments. 
(D) Ligand-induced SEGFR501 dimerization. Cross-linking of wild type and 

35 CR1 loop mutant (Tyr246Asp, Asn247Ala, Thr249Asp, Tyr251 Glu, Gln252Ala 
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and Met253Asp) was carried out as described previously (Elleman et al., 2001. 
Biochemistry 40:8930-8939). 



Kev to Sequence Listing 

5 SEQ ID NO:1 : EGFR as shown in Figure 1 
SEQ ID NO:2: ErbB-2 as shown in Figure 1 
SEQ ID NO:3: ErbB-3 as shown in Figure 1 
SEQ ID NO:4: ErbB-4 as shown in Figure 1 
SEQ ID NO:5: EGF domain as shown in Figure 2 

10 SEQ ID NO:6: TGF-ct domain as shown in Figure 2 

SEQ ID NO:7: Amphiregulin domain as shown in Figure 2 
SEQ ID NO:8: HB-EGF domain as shown in Figure 2 
SEQ ID NO:9: Betacellulin domain as shown in Figure 2 
SEQ ID NO: 10: Epiregulin domain as shown in Figure 2 

15 SEQ ID NO:1 1 : Epigen domain as shown in Figure 2 
SEQ ID NO: 12: NRG1 a domain as shown in Figure 2 
SEQ ID NO: 13: NRG1p domain as shown in Figure 2 
SEQ ID NO: 14: NRG2a domain as shown in Figure 2 
SEQ ID NO: 15: NRG2p domain as shown in Figure 2 

20 SEQ ID NO: 16: NRG3 domain as shown in Figure 2 
SEQ ID NO: 17: NRG4 domain as shown in Figure 2 
SEQ ID NO: 18: EGFR L1 domain as shown in Figure 4A 
SEQ ID NO: 19: IGF 1 R L1 domain as shown in Figure 4A 
SEQ ID NO:20: IGF 1 R L2 domain as shown in Figure 4A 

25 SEQ ID NO:21 : EGFR L2 domain as shown in Figure 4A 
SEQ ID NO:22: EGFR S1 domain as shown in Figure 4B 
SEQ ID NO:23: IGF 1 R S1 domain as shown in Figure 4B 
SEQ ID NO:24: EGFR S2 domain as shown in Figure 4B 
SEQ ID NO:25: TGFa domain as shown in Figure 4C 

30 SEQ ID NO:26: EGF domain as shown in Figure 4C 
SEQ ID NO:27: hbEGF domain as shown in Figure AC 



Detailed description of Preferred Embodiments of the Invention 

The present inventors have now obtained three dimensional structural 
35 information about the EGF receptor which enables a more accurate 

understanding of how the binding of ligand leads to signal transduction. Such 
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information provides a rational basis for the development of ligands for specific 
therapeutic applications, something that heretofore could not have been 
predicted de novo from available sequence data. 

The precise mechanisms underlying the binding of agonists and 
antagonists to the EGF receptor are not fully clarified. However, the binding of 
ligands to the receptor site, preferably with an affinity in the order of 1 0' 8 M or 
higher, is understood to arise from enhanced stereochemical complementarity 
relative to naturally occurring EGF receptor ligands. 

Such stereochemical complementarity, pursuant to the present invention, 
is characteristic of a molecule that matches intra-site surface residues lining the 
groove of the receptor site as enumerated by the coordinates set out in 
Appendix I or Appendix II. Appendix II is a refined version of the coordinates 
provided in Appendix I. 

Substances which are complementary to the shape and electrostatics or 
chemistry of the receptor site characterised by amino acids positioned at 
atomic coordinates set out in Appendix I or Appendix II will be able to bind to 
the receptor, and when the binding is sufficiently strong, substantially prohibit 
binding of the naturally occurring ligands to the site. 

It will be appreciated that it is not necessary that the complementarity 
between ligands and the receptor site extend over all residues lining the groove 
in order to inhibit binding of the natural ligand. 

In general, the design of a molecule possessing stereochemical 
complementarity can be accomplished by means of techniques that optimize, 
chemically and/or geometrically, the "fit" between a molecule and a target 
receptor. Known techniques of this sort are reviewed by Sheridan and 
Venkataraghavan, Acc. Chem Res. 1987 20 322; Goodford, J. Med. Chem. 
1984 27 557; Beddell, Chem. Soc. Reviews 1985, 279; Hoi, Angew. Chem. 

1986 25 767, Verlinde C.L.M.J & Hoi, W.G.J. Structure 1 994, 2, 577, Walters, 
W.P., Stahl, M.T., Murcko, M.A., Drug Discovery Today 1998, 3, 160; Langer, 
T. and Hoffmann, R.D., Current Pharmaceutical Design 2001, 7, 509; Good, A., 
Current Opinion in Drug Disc. Devel. 2001, 5, 301; and Gane, P.J. and Dean, 
P.M., Curr. Opinion Struct. Biol., 2000, 10, 401. the respective contents of 
which are hereby incorporated by reference. See also Blundell et al., Nature 

1987 326 347 (drug development based on information regarding receptor 
structure) and Loughney, DA, Murray, W.V., and Jolliffe, L.K. Med. Chem. 
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Res. 1999, 9, 579 (database mining application on the growth hormone 
receptor). 

There are two preferred approaches to designing a molecule, according 
to the present invention, that complements the stereochemistry of the EGF 
receptor. The first approach is to in silico directly dock molecules from a three- 
dimensional structural database, to the receptor site, using mostly, but not 
exclusively, geometric criteria to assess the goodness-of-fit of a particular 
molecule to the site. In this approach, the number of internal degrees of 
freedom (and the corresponding local minima in the molecular conformation 
space) is reduced by considering only the geometric (hard-sphere) interactions 
of two rigid bodies, where one body (the active site) contains "pockets" or 
"grooves" that form binding sites for the second body (the complementing 
molecule, as ligand). 

This approach is illustrated by Kuntz et al., J. Mol. Biol. 1982 161 269, 
and Ewing, T.J.A. et al., J. Comput-Aid. Mol. Design 2001, 15, 411, the 
contents of which are hereby incorporated by reference, whose algorithm for 
ligand design is implemented in a commercial software package, DOCK 
version 4.0, distributed by the Regents of the University of California and 
further described in a document, provided by the distributor, which is entitled 
"Overview of the DOCK program suite" the contents of which are hereby 
incorporated by reference. Pursuant to the Kuntz algorithm, the shape of the 
cavity represented by the EGF receptor site is defined as a series of 
overlapping spheres of different radii. One or more extant databases of 
crystallographic data, such as the Cambridge Structural Database System 
maintained by Cambridge University (University Chemical Laboratory, Lensfield 
Road, Cambridge CB2 1EW, U.K.) , the Protein Data Bank maintained by the 
Research Collaboratory for Structural Bioinformatics (Rutgers University, N.J., 
U.S.A.), LeadQuest (Tripos Associates, Inc., St. Louis, MO), Available 
Chemicals Directory (Molecular Design Ltd., San Leandro, CA), and the NCI 
database (National Cancer Institute, U.S.A) is then searched for molecules 
which approximate the shape thus defined. 

Molecules identified in this way, on the basis of geometric parameters, 
can then be modified to satisfy criteria associated with chemical 
complementarity, such as hydrogen bonding, ionic interactions and Van der 
Waals interactions. Different scoring functions can be employed to rank and 
select the best molecule from a database. See for example Bohm, H.-J. and 
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Stahl, M. Med.Chem.Res. 1999, 9, 445. The software package FlexX, 
marketed by Tripos Associates, Inc. (St. Louis, MO) is another program that 
can be used in this direct docking approach (see Rarey, M. et al., J. Mol. Biol. 
1996, 261, 470). 

5 The second preferred approach entails an assessment of the interaction 

of respective chemical groups ("probes") with the active site at sample positions 
within and around the site, resulting in an array of energy values from which 
three-dimensional contour surfaces at selected energy levels can be 
generated. The chemical-probe approach to ligand design is described, for 

10 example, by Goodford, J. Med. Chem. 1985 28 849, the contents of which are 
hereby incorporated by reference, and is implemented in several commercial 
software packages, such as GRID (product of Molecular Discovery Ltd., West 
Way House, Elms Parade, Oxford 0X2 9LL, U.K.). Pursuant to this approach, 
the chemical prerequisites for a site-complementing molecule are identified at 

15 the outset, by probing the active site with different chemical probes, e.g., water, 
a methyl group, an amine nitrogen, a carboxyl oxygen, and a hydroxy!. 
Favored sites for interaction between the active site and each probe are thus 
determined, and from the resulting three-dimensional pattern of such sites a 
putative complementary molecule can be generated. This may be done either 

20 by programs that can search three-dimensional databases to identify molecules 
incorporating desired pharmacophore patterns or by programs which using the 
favored sites and probes as input perform de novo design. 

Programs suitable for searching three-dimensional databases to identify 
molecules bearing a desired pharmacophore include: MACCS-3D and ISIS/3D 

25 (Molecular Design Ltd., San Leandro, CA), ChemDBS-3D (Chemical Design 
Ltd., Oxford, U.K.), and Sybyl/3DB Unity (Tripos Associates, Inc., St. Louis, 
MO). 

Programs suitable for pharmacophore selection and design include: 
DISCO (Abbott Laboratories, Abbott Park, IL), Catalyst (Accelrys, San Diego, 
30 CA), and ChemDBS-3D (Chemical Design Ltd., Oxford, U.K). 

Databases of chemical structures are available from a number of 
sources including Cambridge Crystallographic Data Centre (Cambridge, U.K.), 
Molecular Design, Ltd., (San Leandro, CA), Tripos Associates, Inc. (St. Louis, 
MO), and Chemical Abstracts Service (Columbus, OH). 
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De novo design programs include Ludi (Biosym Technologies Inc., San 
Diego, CA), Leapfrog (Tripos Associates, Inc.), Aladdin (Daylight Chemical 
Information Systems, Irvine, CA), and LigBuilder (Peking University, China). 
Those skilled in the art will recognize that the design of a mimetic may 

5 require slight structural alteration or adjustment of a chemical structure 
designed or identified using the methods of the invention. 

The invention may be implemented in hardware or software, or a 
combination of both. However, preferably, the invention is implemented in 
computer programs executing on programmable computers each comprising a 

10 processor, a data storage system (including volatile and non-volatile memory 
and/or storage elements), at least one input device, and at least one output 
device. Program code is applied to input data to perform the functions 
described above and generate output information. The output information is 
applied to one or more output devices, in known fashion. The computer may 

15 be, for example, a personal computer, microcomputer, or workstation of 
conventional design. 

Each program is preferably implemented in a high level procedural or 
object-oriented programming language to communicate with a computer 
system. However, the programs can be implemented in assembly or machine 

20 language, if desired. In any case, the language may be compiled or interpreted 
language. 

Each such computer program is preferably stored on a storage medium 
or device (e.g., ROM or magnetic diskette) readable by a general or special 
purpose programmable computer, for configuring and operating the computer 

25 when the storage media or device is read by the computer to perform the 

procedures described herein. The inventive system may also be considered to 
be implemented as a computer-readable storage medium, configured with a 
computer program, where the storage medium so configured causes a 
computer to operate in a specific and predefined manner to perform the 

30 functions described herein. 

Compounds designed according to the methods of the present invention 
may be assessed by a number of in vitro and in vivo assays of hormone 
function. For example, the identification of EGF receptor antagonists of may be 
undertaken using a solid-phase receptor binding assay. Potential antagonists 

35 may be screened for their ability to inhibit the binding of europium-labelled EGF 
receptor ligands to soluble, recombinant EGF receptor in a microplate-based 
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format. Europium is a lanthanide fluorophore, the presence of which can be 
measured using time-resolved fluorometry. The sensitivity of this assay 
matches that achieved by radioisotopes, measurement is rapid and is 
performed in a microplate format to allow high-sample throughput, and the 
approach is gaining wide acceptance as the method of choice in the 
development of screens for receptor agonists/antagonists ( see Apell et.al. J. 
Biomolec. Screening 3:19-27, 1998 : Inglese et. al. Biochemistry 37:2372-2377, 
1998). 

Binding affinity and inhibitor potency may be measured for candidate 
inhibitors using biosensor technology. 

The EGF receptor antagonists may be tested for their ability to modulate 
receptor activity using a cell-based assay incorporating a stably transfected, 
EGF-responsive reporter gene (Souriau etal., 1997, Nucleic Acids Res. 
25:1585-1590). The assay addresses the ability of EGF to activate the reporter 
gene in the presence of novel ligands. It offers a rapid (results within 6-8 hours 
of hormone exposure), high-throughput (assay can be conducted in a 96-well 
format for automated counting) analysis using an extremely sensitive detection 
system (chemiluminescence). Once candidate compounds have been 
identified, their ability to antagonise signal transduction via the EGF-R can be 
assessed using a number of routine in vitro cellular assays such as inhibition of 
EGF-mediated cell proliferation. Ultimately, the efficiency of antagonist as a 
tumour therapeutic may be tested in vitro in animals beating tumour isografts 
and xenografts as described (Rockwell et al., 1997, Proc Natl Acad Sci U S A 
94:6523-6528; Prewett et al., 1998 Clin Cancer Res 4:2957-2966). 

Tumour growth inhibition assays may be designed around a nude mouse 
xenograft model using a range of cell lines. The effects of the receptor 
antagonists and inhibitors may be tested on the growth of subcutaneous 
tumours. 

EXAMPLES 

Example 1 : Protein preparation of SEGFR501 

The derivation of stably transfected Lec8 cells expressing sEGFR501 
and the subsequent purification and characterisation of the secreted 
ectodomain has been described in detail (Elleman et al., 2001 , Biochemistry 
40:8930-8939.). Purified sEGFR501 was shown, by isoelectric focusing gels to 
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be unstable on storage, the majority of isoforms being transformed into 
products with less acidic isoelectric points. This change was accompanied by a 
small mobility increase (estimated at 1-2 kDa ) on SDS polyacrylamide gels. N- 
terminal sequence analysis showed that the new product retained the 

5 expressed N-terminus of SEGFR501 , suggesting that the apparent 1 -2 kDa 
reduction in mass and increase in positive charge might be due to partial or 
complete loss of the acidic-residue rich C-terminal tag and enterokinase 
cleavage site. Prolonged storage led to the majority of protein converting to the 
least acidic isoform of pi -6.6, which appeared to remain stable. The 

10 conversion of a fresh preparation of SEGFR501 to a stable, less acidic isoform 
was more reproducible and rapid if it was subject to limited proteolysis at 
ambient temperature in Tris-buffered saline (pH8) for -180 min with 
endoproteinase Asp-N (Boehringer-Mannheim) at an enzyme: protein ratio of 
1 :1000 (w/w). The least-acidic isoform of apparent pi -6.2 was isolated from 

15 the other components by anion exchange chromatography. The digest was 
bound to three Uno Q2 columns (BioRad) connected in series to a BioLogic HR 
liquid chromatography instrument in 20 mM ethanolamine /50 mM taurine 
pH8.0 buffer and the least acidic form was the first product obtained by 
isocratic elution in the same buffer containing 15 mM lithium acetate. The 

20 purified protein was incubated with endoglycosidase F (PNGase-free - 
Boehringer Mannheim) at a ratio of 10-20 Units/mg protein, followed by 
rechromatography over Superdex 200 to remove enzyme and low molecular 
weight cleavage products. 

25 Example 2: Crystallization and Data Collection 

SEGFR501 obtained from the above procedures appeared nearly 
homogeneous on SDS and IEF gels and was used in crystallization trials alone 
and in combination with several ligands. The best diffracting crystals were 
obtained from mixtures containing a five-fold molar quantity of human TGFa 

30 (GroPep receptor grade) compared to sEGFR501 . Crystals of SEGFR501 in 
complex with TGFa were grown in 7% PEG 3350, 20% Trehalose, 10 mM 
CdCI2 and 100mM HEPES, pH 7.5, and belonged to the space group P21 (a = 
51.59, b = 198.71, c = 78.90 A, p = 102.03°). These crystals were cryo-cooled 
to -170°C in the same mother liquor. Data were recorded on a Rigaku RAXIS 

35 VI area detector using a Siemens M18XHF X-ray generator with Yale/MSC 
mirrors or a Rigaku RU300 generator and AXCO capillary optics. Crystals 
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were also derivatised by soaking in mother liquor containing 1-10 mM heavy 
atom compounds and diffractions data were collected as before and statistics 
are given in Table 1 . The resolution limit was defined as where l/a = 2 for 50% 
of the reflections. Notable anisotropy was observed for the diffraction limit of 
5 the crystals and in the mosaic spread of diffraction maxima. 

Example 3: Phase Determination and Structure Refinement 

Phasing by multiple isomorphic replacement was performed with 
programs from CCP4 (Collaborative Computational Project Number 4, 1994) 

10 and SHARP (De La Fortelle and Bricogne, 1996, , Methods Enzymol. 276: 472- 
494) and the resulting electron density maps were improved by solvent 
flattening and histogram matching with DM (Cowtan, K. 1994, Joint CCP4 and 
ESF-EACBM Newslett. Protein Crystallogr. 31:34-38). Details are given in 
Table 1. Density averaging using noncrystallographic symmetry was not of 

15 much value as the proteins corresponded to more than three rigid groups. The 
polypeptide chains for two receptor and two ligand molecules were fitted 
manually and refined with CNS (Brunger, et al., 1998, X-PLOR Reference 
Manual 3.851, Yale Univ., New Haven, CT). As the highest resolution data 
were collected for the PIP derivative these data were use for the final stages of 

20 refinement. During the refinement an overall anisotropic temperature factor 
was applied, with the magnitude of the semi-axes being -18.4, 5.6 and 12.7 A 2 . 
The refined structure contains 1097 amino acids, 14 carbohydrate residues, 7 
Pt 2+ , 1 1 Cd 2+ and 4 CI" ions and 79 water molecules. Poor density was 
observed for residues 148-160 and 289-307 in each receptor and no density 

25 was found for ligand residues C1 and D1 -D2 and receptor residues A306 and 
beyond residues A500 and B501 . 

Example 4: Construction of N-terminal tagged EGF receptor and mutants 

The polymerase chain reaction (PCR) using a human EGFR cDNA 
30 (Accession # x00588) (Ullrich et al., 1984, Nature 309:41 8-425) was used to 
generate EGFR expression constructs. It is noted that the original EGFR cDNA 
sequence contains an error at position 1806G (Accession # x00588). The 
correct base is 1806C, which destroys the Hind III restriction site in the original 
cDNA sequence. To construct the FLAG tag at the N-terminus of the receptor, 
35 PCR products containing EGFR leader sequence (and small portion of 5' non- 
coding sequence, base pair 131 to 261), followed by the FLAG coding 
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sequences with Hind ill and Xho I on its 5' and 3' ends, respectively, were 
generated and cloned into a mammalian expression vector pcDNA3 
(Invitrogen) using those restriction sites. The Xho I site coding for Leu and Glu 
of mature EGFR residues 1 and 2 was generated by silent mutation and an 

5 Xba I site was generated after the stop codon (381 7-381 9) of EGFR cDNA 
using PCR. Cloning such modified EGFR cDNA into the FLAG tag containing 
pcDNA3 vector yielded the wild-type N-terminus tagged EGF receptor 
construct, M2-EGFR. PCR products containing point mutations and S1-loop 
deletion were cloned using the wild-type M2-EGFR as a template. The point 

10 mutation constructs are E21A, R470L, N473D, S474E and A477D. The S1- 
loop deletion construct contains a replacement of nucleotides 988-1035 by 
GCC, resulting in S1-loop residues 244-259 being replaced by a single alanine 
residue. The SEGFR501 S1-loop mutant (Tyr246Asp, Asn247Ala, Thr249Asp, 
Tyr251Glu, Gln252Ala and Met253Asp) was generated by oligonucleotide- 

15 directed in vitro mutagenesis using the USB-T7 Gen kit, transiently expressed, 
purified and characterised as described previously (Elleman et al., 2001 . 
Biochemistry 40:8930-8939). 

Example 5: Transient expression of wild-type and mutant EFGR 

20 NIH3T3 and 293 cells were obtained from the American Type Culture 

Collection. The cells were grown in a 10% C0 2 atmosphere at 37 °C in 
Dulbecco's modified Eagle's medium (for NIH3T3) or in RPMI medium (for 293) 
(both from Life Technologies. Inc.) containing 10% foetal bovine serum (CSL, 
Australia), 60 |xg/ml pencillin and 100 ng/ml streptomycin. Transient 

25 transfections were performed using FuGENETM 6 (Roche Molecular 

Biochemicals) according to manufacture's protocol. Cells were seeded at 
-10% (for NIH3T3) or -25% (for 293) confluency in 6-well plate and transfected 
with 0.5 \ig plasmid DNA per construct per well. Transfected cells were 
assayed two days later. For western blotting, cells were washed with serum- 

30 free medium, starved for 2 hr and treated with or without EGF (1 OOng/ml) for 1 0 
min. Whole cell lysates were prepared, fractionated by SDS-gel 
electrophoresis using 4-20% polyacrylamide gels and western blotted using the 
monoclonal antibodies M2 (anti-FLAG, Sigma) and 4G10 (anti- 
phosphotyrosine, Upstate Biotechnology) as described (Walker et al, 1998, 

35 Growth Factors 16, 53-67). 
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Example 6: Characterisation of wild-type and mutant EGFR stably 
expressed in BaF/3 cells. 

The isolation and characterisation of stably transfected cell lines 
expressing wild-type and mutant EGFRs was performed using the 113- 

5 dependent murine hemopoietic lineage BaF/3 (Walker et al, 1 998, Growth 
Factors 16, 53-67). Expression vectors containing the appropriate EGFR 
constructs were transfected individually by electroporation using a Gene Pulser 
(BioRad) according to manufacturer's instructions. Neomycin-resistant pools 
were generated by selection in G418, and cloned by limiting dilution to obtain 

10 stable cell lines. Cell-surface expression of receptors was detected by 

FACScan (Fluorescence Activated Cell Scan, Becton and Dickinson) using the 
anti-EGFR monoclonal antibody 528 (Gill et aL, 1984, J. Biol. Chem. 259:7755- 
7760) and the M2 anti-FLAG antibody (Brizzard et al., 1994, Biotechniques 
16:730-735). Ligand binding studies and Scatchard analysis were performed 

15 using iodinated murine EGF as previously described (Walker et al, 1998, 
Growth Factors 16, 53-67). Scatchard plots and estimates of affinities and 
receptor numbers were obtained using the Radlig program (Kell for Windows, 
BioSoft). Ligand-induced receptor kinase activation was analysed by 
immunoblotting cell lysates with 4G10. For receptor cross-linking studies, 

20 washed cells were incubated in PBS with or without EGF (1 OOng/ml) and with 
or without BS3 (Pierce; 1 .3mM) for 20 min at 37°C. The cells were then lysed 
and analysed by immunoblotting using a polyclonal sheep anti-EGFR antibody 
(Upstate Biotechnology) as described (Walker et al.,1998. Mol. Cell Biol. 
18:7192-7204). 

25 

Example 7: Overall Structure 

SEGFR501 is comprised of three structural domains, namely L1, S1 and 
L2 plus the first module from the second cys-rich region S2. Crystals of 
TGFa:sEGFR501 contain two molecules of each polypeptide in the asymmetric 

30 unit. There are two possible dimer interactions: a back-to-back dimer 

dominated by interactions between the S1 domains of each receptor and a 
head-to-head dimer involving contacts between the L1 and L2 domains. The 
back-to-back complex is approximately 33 x 78 x 1 03 A while the head-to-head 
complex is 65 x 75 x 128 A. Each TGFa molecule is clamped between the L1 

35 and L2 domains from the same SEGFR501 molecule, and makes contact with 
only one receptor molecule in the dimer. In the back-to-back dimer the two 
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ligands are located on opposite sides of the complex with the closest approach 
70.9 A apart. In the head-to-head dimer the two ligands are centrally located, 
and are separated by 1 5 A. 

We conclude that the back-to-back dimer corresponds to the 2:2 

5 TGFa:sEGFR501 complex that is formed in solution (Elleman et al., 2001 . 
Biochemistry 40:8930-8939) from comparisons of the amount of buried surface 
area in the two dimer options, the lack of symmetry in the head-to-head dimer 
compared to that seen in the back-to-back dimer, the sequence conservation at 
the dimer interfaces (described later) and the characteristics of the receptors 

10 mutated at both interfaces (described later). In the head-to-head dimer only 
510 A2 of accessible surface area is buried on each molecule and this is 
distributed over two patches 39 A apart. The residues involved are 21, 24, 25, 
28 and 48-51 on both Lis, 471, 473, 474, 476 and 477 on both L2s plus 32 
(molecule A) and 443 and 478 from molecule B. In contrast, in the back-to- 

15 back dimer 1 1 25 A2 on each receptor is buried. Biologically relevant protein- 
protein interfaces usually bury more than 700 A2 of surface per molecule and 
often about 1000 A2 (Lo Conte et al., 1999, J. Mol. Biol. 285:2177-2198), 
implying that the back-to-back configuration is more likely to be the functional 
dimer. There is a lack of symmetry at the two L1-L2' interfaces in the head-to- 

20 head dimer which corresponds to a 6 A translation of the L2' helix (residues 
471-479) relative to the L1 helix. Such structural ambiguity is not seen in the 
back-to-back dimer (Figure 3), the non-crystallographic symmetry being very 
close to a pure two-fold rotation, implying that this is the functional dimer. It is 
further supported by experiments where a model of the EGF receptor S2 

25 domain (Jorissen et al., 2000, Protein Sci. 9:31 0-324) was superimposed onto 
the structure determined here for the first modules of the S2 domains of the two 
SEGFR501 molecules. In the back-to-back dimer the rod-like domains of S2 
project towards each other underneath sEGF501 , consistent with the ability to 
form disulfide-linked dimers via a Cys mutation three residues upstream of the 

30 transmembrane domain when ligand binds to mutant receptors (Sorokin et al., 
1994, J. Biol. Chem. 269:9752-9759). The same superimposition performed on 
the head-to-head dimer results in the modelled S2 domains projecting away 
from each other and is inconsistent with the Cys mutant data (Sorokin et al., 
1994, J. Biol. Chem. 269:9752-9759). 
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Example 8: Receptor domain architecture 

The L1, S1 and L2 domains show both sequence (Figure 4) and 
structural (Figure 5) homology to the first three domains of the type I insulin-like 
growth factor receptor (Garrett et al., 1998, Nature 394:395-399). More 

5 broadly, the L domains resemble other leucine-rich repeat or solenoid proteins 
(Ward, C. W. and Garrett, T. P. J. 2001, BMC Bioinformatics 2, 4 ; Kobe B. and 
Kajava, A. V. 2001 , Curr. Opin. Struct. Biol. 1 1:725-732). Each L domain is 
composed of six turns of a p-helix or solenoid and is capped at each end by a 
helix and a disulfide bond. At the C-terminus of the L domains the helix is only 

10 vestigial and in each case there is intimate association with the first module of 
S1 or S2. A conserved Trp from each of these first modules (Trp176 in S1 and 
Trp492 in S2) is inserted into the body of the L domain between the fourth and 
fifth turns of the p-helix as seen in IGF-1 R (Garrett et al., 1998, Nature 
394:395-399 ), making these modules structurally part of the L domain. In each 

15 case the loops in the first cys-rich modules of the S1 and S2 domains of 
SEGFR501 are shorter than those in IGF-1 R and similar in size to the other 
modules in SEGFR501 (modules 2 and 3 in S1 and 4 and 7 in S2) which 
contain two disulfide bonds (Figures 4A and 4B). 

Each of the L domains contains a large p-sheet (second sheet, in Figure 

20 5), flanked by two shorter ones on either side (blue and yellow). The edge 
between the first and second p-sheets is characterised by the presence of a 
stack of conserved Gly residues at positions 39, 63, 85, 122 in L1 and 343, 
379, 404 and 435 in L2 (Figure 4A). The edge at the junction of the second 
and third p-sheets is formed, in part, by a short Asn ladder as in IGF-1 R 

25 (Garrett et al. , 1 998, Nature 394:395-399). A loop from the fourth turn of each 
solenoid protrudes from the large (second) p-sheet and is common to the EGF 
and IGF receptor families. Opposite the large p-sheet in both L1 and L2 there 
is a more irregular face, with the polypeptide strands in the third, fourth and fifth 
turns in L2 having a similar conformation to those in IGF-1 R L1 but different 

30 from those in EGFR L1. 

For both L1 and L2 domains of EGFR the long p-strand in the first turn of 
the solenoid is missing. In L1 this strand is replaced by a long V-shaped 
excursion (residues 8-18) of the polypeptide chain which sits over the large p- 
sheet of this domain to form a major part of L1's ligand-binding surface (Figure 

35 6). In L2 this second strand is replaced by a loop (residues 316-326) which 
also contacts the ligand (Figure 6). 
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The order and association of the eight disulfide-bonded modules in S1 
are similar to that of IGF-1 R (Figures 4A and 4B), with the first module packed 
against the fourth face of the L1 domain as discussed above and modules 2-8 
forming a rod-like domain (Figure 5) spanning from L1 to L2. Relative to IGF- 

5 1 R, each of the disulfide bonded modules in SEGFR501 is oriented slightly 
differently to the previous one (8-36°), with the cumulative effect being that S1 
of the EGFR appears as a straight rod, bent at module 6, whereas in IGF-1 R 
the S domain is curved. Even for the two molecules of EGFR in the crystal's 
asymmetric unit there is a relative difference between modules 6 and 7 of 12°, 

10 implying that the modules are not always rigidly associated. 

Like IGF-1 R, S1 of EGFR makes contact with L1 along one side of the 
solenoid (sheet 1, burying 1375 A 2 of accessible surface area) but in EGFR, S1 
also makes appreciable contact with the L2 domain via modules 6 and 7 
(burying 860 A 2 ). This is different to the IGF-1 R structure where the L2 domain 

15 is rotated away to lie almost perpendicular to the axis of L1 (Figure 5). Thus 
the C-terminal region of S1 may act as a hinge in the ligand-free form of the 
EGFR as modules 7 and 8 appear somewhat mobile, having some of the 
largest temperature factors in the structure. 

The most striking feature of S1 is a large ordered loop from module 5 

20 which projects directly away from the ligand-binding site. The loop consists of 
residues 242-259 and contains an antiparallel p-ribbon (Figure 5). This loop is 
highly conserved within the EGFR family and is different to the insulin receptor 
family where a loop of similar size points from module 6 into the ligand-binding 
site (Figure 5). If EGFR were to have a loop similar to IGF-1 R, there would be 

25 a substantial steric clash between that loop and L2. 

Example 9: Structure of TGFct 

More than 10 mitogenic peptides form a family of ligands which can bind 
to members of the EGFR family. However, apart from residues Gly19, Gly40 

30 and the three conserved disulfide bonds which are needed to maintain 
structure, only Arg42 is conserved throughout the family and pairwise 
sequence identities between the ligands are often less than 35%. Three- 
dimensional structures have been determined by NMR for EGF (Montelion et 
al., 1987, Proc. Natl Acad. Sci. USA. 84, 5226-5230; Cooke et al., 1987, 

35 Nature 327:339-341; Kohda et al., 1992, Biochemistry 31:11928-11939; 
Barnham, et al.,1998, Protein Sci. 7:1738-1749), TGFa (Tappin et al., 1989, 
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562; Moy et al.,1993, Biochemistry 32:7334-7353) and heregulin (Nagata et al, 
1994, EMBO J. 13:3517-3523; Jacobsen et al, 1996, Biochemistry 35, 3402- 
341 7) and by X-ray crystallography for heparin-binding EGF (HB-EGF) in 
5 complex with diphtheria toxin (Louie et al., 1997, Mol. Cell 1:67-78) and EGF 
(Lu, et al., 2001, J. Biol. Chem. 276:34913-34917). These structures show that 
TGFa and its relatives are relatively flexible molecules built on a small 
structurally conserved core. In particular, the N- and C-terminal residues are 
often quite disordered. From a comparison of the two molecules of EGF in the 
10 asymmetric unit, (Lu, et al., 2001, J. Biol. Chem. 276:34913-34917) found that 
the common structural core comprised only residues 13-21 and 30-47 
(equivalent to 15-22 and 31-48 in TGFa, Figure 4C) which encompassed half of 
the large p-ribbon and a small, C-terminal p-ribbon. The structure of TGFa, 
seen here in the complex, shows substantially more order, with a third, Ni- 
ls terminal p-strand (residues 4-6) aligned with the large p-ribbon (residues 19-33) 
to form a three-stranded p-sheet and an ordered C-terminus. The structure of 
TGFa in the 2:2 complex is triangular or crescent shaped. The two TGFa 
molecules in the dimer superimpose well on each other (rmsd 0.70 for 44 Ca 
atoms). They are structurally similar to the human EGF molecule A (rmsd 1 .33 
20 A for 41 Ca atoms) in the EGF crystal structure (Lu, et al., 2001 , J. Biol. Chem. 
276:3491 3-3491 7) and even more closely to HB-EGF (0.66 A for 34 Ca atoms) 
in its complex with diphtheria toxin (Louie et al., 1997, Mol. Cell 1:67-78). 

Example 10: Liqand-receptor interactions in the EGF receptor 

25 In the complex, each SEGFR501 monomer interacts with a single TGFa 

molecule and each ligand interacts with the large p-sheets of both the L1 and 
L2 domains of one receptor molecule (Figures 3 and 6). Relative to IGF-1 R, 
the position of L2 corresponds to a rotation by 105° at the L2/S1 module7 
interface or 122-130°, relative to L1 of IGF-1 R. More than a third of the ligand's 

30 accessible surface area is buried by the L1 and L2 domains of the receptor 
(about 745 A 2 by L1 and about 785 A 2 by L2) and over 60% of the ligand's 
residues make contact with the receptor. The footprint of the ligand on the 
receptor covers most of the large (second) sheet of each L domain, running 
from the top left corner to abut the loop in the fourth rung of the solenoid 

35 (Figures 3 and 6). 
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In the contact with L1 , the inner curved face of the crescent-shaped 
TGFa sits across the large sheet and extends to the N-terminal helix of L1 
(Figure 6). More than half the buried surface area of L1 comes from a V- 
shaped loop which runs across the large sheet, replacing the first strand of the 

5 corresponding sheet in IGF-1 R. In the center of this interface TGFa makes 
contact with the receptor, primarily via main chain atoms. One strand from the 
large p-sheet of TGFa (residues 29-35) sits edge on to the receptor and aligns 
with the latter part of the V-shaped loop (residues 1 5-17) in L1 's first solenoid 
turn. This enables the receptor to contribute part of the V as a fourth parallel p- 

10 strand to the first and larger of the ligand's two p-sheets (Figure 6). Asn1 2, 
which is conserved in all of the EGFR family except ErbB2, makes a side chain 
to main chain contact with the peptide N atom of Gly40 in TGFa. The O7I 
atom of Thr15 from L1 also makes a hydrogen bond to Ala41 O of TGFa. This 
interface is also characterized by a small hydrophobic contact around Leu17 

15 from L1 and hydrophilic and electrostatic interactions involving the ligand's 'B 
loop' residues Arg22, Gln26, Glu27 and Lys29 with the L1 domain residues 
Tyr45, Tyr101, Arg125, and Glu90 respectively. The location of the N-terminus 
of TGFa near Tyr101 in the complex is consistent with the chemical cross- 
linking data of (Woltjer et al., 1992, Proc. Natl. Acad. Sci. USA. 89, 7801-7805). 

20 It should be noted that the lack of conservation in ErbB2 of two key residues in 
this interface (Arg for Thr/Ser at position 1 5 and Met for Asn at position 12) 
would prevent any of the EGF family of ligands from binding to L1 . 

The interface between L2 and TGFa is formed mostly from the side chain 
atoms of both the ligand and receptor. TGFa sits on the flat face (i.e. the large 

25 p-sheet) of L2, surrounded by three loops (residues 316-326, 352-363 and 405- 
412) which project out from the plane of the sheet (Figure 6). The contact 
between the ligand and receptor is an alternating series of stripes of 
hydrophobic and hydrophilic interaction across the interface. These are as 
follows: (i) Phe15 of TGFa sits against Phe357 of EGFR; (ii) the strictly 

30 conserved Arg42 of TGFa is sandwiched between Phe1 5 and Phe1 7 of the 
ligand facilitating the correct orientation and environment to make a salt bridge 
with the strictly conserved Asp355 of the receptor; (iii) Phe 17 and the lower 
part of Glu44 from TGFa interact with Leu325, Leu348 and Val350 from L2; (iv) 
the next hydrophilic region contains four histidines, His18 and His45 of TGFa 

35 and His346 and His409 of L2, as well as Tyr38 and Glu44 from TGFa and 
Gln384 and Gln408 from L2; and (v) there is a hydrophobic pocket in L2 
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(Leu382, Gln408, His409, Phe412, Val 417, Ile438), centred over Ala415, 
which holds the highly conserved Leu48 of TGFa (Leu47 in EGF), the ligand 
residue with the largest buried surface The C-terminus of TGFa is sandwiched 
between domains L1 and L2, with the side chain of Leu49 contacting both L 

5 domains. Leu49 may well define the final positioning of the L domains in the 
complex. Lys465 from L2 is near the C-terminus of TGFa and may stabilise the 
terminal carboxyl group. Lys465 has been chemically cross-linked to residue 
45 in a mutant form of mouse EGF (Summerfield et al.,1996, J. Biol. Chem. 
271:19656-19659). Some carbohydrate nearby could possibly also affect ligand 

10 binding. 

There appears to be a number of key contacts, with the ionic interaction 
between TGFa Arg42 and EGFR Asp355 and the hydrophobic interaction 
between TGFa Leu48 and the hydrophobic pocket centred over EGFR Ala415 
being particularly important. These features are conserved in all ErbB family 
15 members. 

Although the interactions of EGFR with TGFa are ostensibly the same for 
both molecules in the crystal's asymmetric unit, it should be noted that when 
the ligands are superimposed, the L1 domains differ by a rotation. of 3.5° about 
Leu14 Cy and for the L2 domain approximately 8° about Ala415 Cp in EGFR 
20 and the side chain of Leu48 in TGFa. These observations suggest that while 
there may be a bit more flexibility in the TGFa:L2 interface, Leu48 is the major 
determinant of ligand binding to L2. The cluster of His residues in the middle 
of the L2 interface may play a part in release of the ligand at low pH following 
endocytosis. 

25 

Example 1 1 : Receptor-receptor interactions 

Unlike other growth factor receptor complexes, the ligand is not found at 
the dimer interface in the 2:2 complex of TGFa:sEGFR501 . Thus ligand 
induced dimerization of sEGFR501 implies that binding of ligand induces a 

30 conformational change in the receptor that promotes receptor-receptor 

interactions. The most notable feature of the back-to-back dimer is a long loop 
(residues 242-259) which is specific to the EGFR family and is not found in the 
CR of IGF-1 R (Figures 4B and 5) or other members of the insulin receptor 
family. From each receptor the loop projects out from the fifth module of S1 , 

35 across the other S1 domain to a space between L1 , L2 and S1 domains of the 
neighbouring receptor (Figure 3). Contact is made by residues 244-253 of the 
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S1 loop in, say, molecule A with residues 229-239, 262-278, and 282-288 on 
the concave face of the S1 domain of molecule B (Figure 3). The buried 
surface areas are 480 A 2 and 330 A 2 , respectively. At specific positions in the 
S1 loop there is remarkable sequence conservation across all ErbB family 

5 members. Tyr246 is strictly conserved and is completely buried in the interface. 
The On atom of TyrA246 (receptor molecule A) makes hydrogen bonds with 
the GlyB264 N and CysB283 0 atoms (receptor molecule B) and the phenyl 
ring sits against the Cp atoms of SerB262 and SerB282 and the face of the 
following peptides (Figure 7). Residue 251 is strictly conserved as Tyr or Phe 
10 and in this interface makes a hydrophobic contact via the benzene ring with the 
PheB263, GlyB264, TyrB275 and ArgB285. The Oil of TyrA251 is exposed to 
solvent. Additional hydrophobic contacts are made by ProA248 to PheB230 
and AlaB265; and by MetA253 to ThrB278. There is also a hydrogen bond from 
TyrA251 O to ArgB285 N (Figure 7). 

15 Other conserved residues of the S1 loop, such as Asn247 and Asn256, 

do not make contact with the other half of the dimer, but hydrogen bond back 
onto the main chain and appear to be important for maintaining the loop in the 
appropriate conformation. There are four positions in the loop (residues 243, 
248, 255 and 257) where proline is found in at least one member of the human 

20 EGFR family with ErbB3 having as many as three prolines. These prolines 
would further stabilise the conformation of the loop. 

The loop not only touches the S1 domain of its partner, but also reaches 
across to contact the L1 and L2 domains of the other receptor molecule 
(burying a surface area of 40 A 2 on L1 and 5 A 2 on L2). AsnB86 touches 

25 ThrA249 and, with a slight rearrangement, could form a hydrogen bond 

between the side chains. Neither residue is conserved in other ErbB receptors 
although polar residues predominate at these positions. ThrA250, which is 
conserved in other ErbB receptors, sits near HeB318 but the reason for the 
conservation is not apparent. Although these interactions are quite weak, it is 

30 possible that the binding of the loop from one receptor may be affected by 
binding of ligand to the other, as ligand binding may alter the relative positions 
of the L domains. 

Two other regions also participate in the back-to-back dimer contact. 
One is near the two long loops, where Asp279 and His280 of receptor A make 

35 contact across the dimer axis with the corresponding residues from receptor B 
(Figure 3). A second region of contact is near the N-terminal end of the S1 
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domain in cys-rich module 2, where residues 193-195 and 204-205 from 
molecule A contact 1 93-194 and 204-205 from molecule B, burying about 225 
A 2 

5 Example 12: Functional Characterisation of mutant EGF Rs expressed in 
BaF/3 cells 

In order to establish the biological relevance of the two dimers identified 
in crystals of the TGFa:sEGFR501 complex, mutant receptors designed to 
probe the two dimer interfaces were analyzed. Single amino acid substitutions 

10 Glu21Ala, Arg470Leu, Asn473Asp, Ser474Glu and Ala477Asp were prepared 
to test the head-to-head dimer. When transiently expressed in 293 cells, which 
express low endogenous levels of EGFR (<1x10 4 receptors/cell), or when 
stably expressed (Glu21Ala) in the hemopoietic cell line BaF/3 which do not 
express EGFR family members (Walker et al., 1998, Growth Factors 16:53-67), 

15 these mutants showed normal EGF binding, kinase activation, dimerization 
(Figure 8) and internalization (data not shown). In contrast mutants of the 
back-to-back dimer, an S1 loop deletion (residues A242-259) from the full 
length receptor and sEGFR501 with multiple substitutions in the S1 loop 
(Tyr246Asp, Asn247Ala, Thr249Asp, Tyr251Glu, Gln252Ala and Met253Asp) 

20 were defective. The AS1-loop clones fail to show ligand-induced dimerization 
and ligand-induced kinase activation and exhibit only low affinity binding 
(Figure 8A, B, C). The SEGFR501 mutants fail to show ligand-induced 
dimerization (Figure 8D) and exhibit 15 fold lower affinity binding on BIAcore 
(500 nM vs 30 nM for sEGFR501). 

25 

Conclusion 

Ligand-induced dimerisation (or oligomerisation) of receptors is a 
common means of signal transduction and in all cases seen so far the ligand 
participates directly in the dimerisation of receptors. For VEGF/Flt-1 

30 (Wiesmann et al.,1 997, Cell 91 :695-704), nerve growth factor (NGF)/TrkA 
receptor (Weismann et al.,1999, Nature 401:184-188.), bone morphogenic 
protein (BMP)/BMP receptor (Kirsch et al., 2000, Nat. Struct. Biol. 7:492-496), 
interferon y(IFNy)/IFNy receptor (Thiel et al., 2000, Structure Fold Des. 8:927- 
936) and tumour necrosis factor (TNF)/TNF receptor (Banner et al., 1993, Cell 

35 73:431 -445), the ligand is a dimer or trimer before forming the 2:2 complex or 
3:3 complex, and in the structures determined, the receptors do not contact 
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each other. In the 2:2 complex of the fibroblast growth factor (FGF)/FGF 
receptor the ligands do not contact each other but are dimerised by heparin 
(Plotnikov et al.,2000, Cell 101:413-424; Schlessinger et a!., 2000, Molecular 
Cell 6:743-750; Sorokin et al., 1994 J. Biol .Chem. 269:9752-9759; Pelligrini et 

5 al., 2000, Nature 407:1029-1034). The FGF receptors do contact each other 
and the two FGF ligands lie at the dimer interface with a heparin molecule 
sitting between two FGFs. In the 2:2 complex of granulocyte colony stimulating 
factor (GCSF)/GCSF receptor (Aritomi et al., 1 999 , Nature 401 .713-715) each 
ligand binds both receptors but there are no contacts between the two ligands 

10 or the two receptor fragments. Finally, in the growth hormone, erythropoietin 
and prolactin/receptor complexes, there is only one ligand molecule in the 1 :2 
complex and the two receptor molecules make contact with ligand and with 
each other (de Vos et al.,1992, Science 255:306-312). 

The TGFa:EGFR complex represents a new and surprising way in which 

15 receptors and protein ligands interact. EGFR ligands bind at a site remote from 
the dimer interface and must modify the receptor to promote dimerisation. A 
precedent for this has been seen for much smaller ligands. For example, in the 
rat metabotrophic glutamate receptor, a disulfide-linked homodimer, binds 
glutamate between two domains of the receptor monomer, causing them to go 

20 from an 'open' to a 'closed' form (Kunishima et al.,2000, Nature 407:971-977). 
Such a mechanism could also occur in the EGFR family where the ligand binds 
both L1 and L2, fixing the relative orientations of the two domains. Compared 
to IGF-1 R there is a substantial rearrangement of L domains in EGFR (Figure 
5) although a conformational change of such a magnitude would not be 

25 necessary. A smaller change in L domain positions upon ligand binding, 

possibly with hinge motions seen at the S1 module 5/6, 6/7 and 7/L2 interfaces 
(relative to IGF-1 R), could enable EGFR extracellular domains to form dimers. 

The disclosure of all publications referred to in this application are 
include herein by reference. 

30 It will be appreciated by persons skilled in the art that numerous 

variations and/or modifications may be made to the invention as shown in the 
specific embodiments without departing from the spirit or scope of the invention 
as broadly described. The present embodiments are, therefore, to be 
considered in all respects as illustrative and not restrictive. 
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Claims: 

1 . A method of selecting or designing a compound that interacts with a 
receptor of the EGF receptor family and modulates an activity associated with 
the receptor, the method comprising 

(a) assessing the stereochemical complementarity between the 
compound and a topographic region of the receptor, wherein the receptor 
comprises: 

(i) amino acids 1-501 of the EGF receptor positioned at atomic 
coordinates as shown in Appendix I or Appendix II, or structural 
coordinates having a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1 .5A; 

(ii) one or more subsets of said amino acids related to the coordinates 
shown in Appendix I or Appendix II by whole body translations 
and/or rotations; or 

(iii) amino acids present in the amino acid sequence of a receptor of 
the EGF receptor family, which form an equivalent three- 
dimensional structure to that of amino acids 1-501 of the EGF 
receptor positioned at atomic coordinates substantially as shown in 
Appendix I or Appendix II, or structural coordinates having a root 
mean square deviation from the backbone atoms of said amino 
acids of not more than 1 .5A, or one or more subsets thereof, 

(b) obtaining a compound which possesses stereochemical 
complementarity to a topographic region of the receptor; and 

(c) testing the compound for its ability to modulate an activity 
associated with the receptor. 

2. A method as claimed in claim 1 wherein the receptor is EGFR and the 
topographic region of EGFR is the ligand binding surface defined by amino 
acids 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127 and 
128, and/or the ligand binding surface defined by amino acids 325, 346, 348- 
350, 353-358, 382, 384, 408, 409, 411, 412, 415, 417, 418, 438, 440, 465 and 
467. 

3. A method as claimed in claim 2 wherein the compound is selected or 
designed to have portions that match residues positioned on the ligand binding 
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surface of EGFR defined by amino acids 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 
90, 98, 99, 101-103, 125, 127 and 128, and/or the ligand binding surface of 
EGFR defined by amino acids 325, 346, 348-350, 353-358, 382, 384, 408, 409, 
411, 412, 415, 417, 418, 438, 440, 465 and 467. 

5 

4. A method as claimed in claim 1 wherein the receptor is ErbB-2 and the 
topographic region of ErbB2 is the surface defined by amino acids 9-16, 18, 20, 
24, 27, 28, 43, 67, 87, 88, 96, 97, 99-101 , 133, 135 and 136, and/or the surface 
defined by amino acids 333, 354, 359-358, 361-366, 390, 392, 416, 417, 419, 

10 420, 423, 425, 426, 446, 448, 473 and 475. 

5. A method as claimed in claim 4 wherein the compound is selected or 
designed to have portions that match residues positioned on the surface of 
ErbB2 defined by amino acids 9-16, 18, 20, 24, 27, 28, 43, 67, 87, 88, 96, 97, 

15 99-1 01 , 133, 1 35 and 1 36, and/or the surface of ErbB2 defined by amino acids 
333, 354, 359-358, 361-366, 390, 392, 416, 417, 419, 420, 423, 425, 426, 446, 
448, 473 and 475. 

6. A method as claimed in claim 1 wherein the receptor is ErbB-3 and the 
20 topographic region of ErbB-3 is the ligand binding surface defined by amino 

acids 14-21, 23, 25, 29, 32, 33, 48, 72, 92, 93, 101, 102, 104-106, 129, 131 
and 132, and/or the ligand binding surface defined by amino acids 322, 343, 
345-347, 350-355, 379, 381, 405, 406, 408, 409, 412, 414, 415, 436, 438, 464 
and 466. 

25 

7. A method as claimed in claim 6 wherein the compound is selected or 
designed to have portions that match residues positioned on the ligand binding 
surface of ErbB-3 defined by amino acids 14-21, 23, 25, 29, 32, 33, 48, 72, 92, 
93, 101, 102, 104-106, 129, 131 and 132, and/or the ligand binding surface of 

30 ErbB-3 defined by amino acids 322, 343, 345-347, 350-355, 379, 381 , 405, 
406, 408, 409, 412, 414, 415, 436, 438, 464 and 466. 

8. A method as claimed in claim 1 wherein the receptor is ErbB-4 and the 
topographic region of ErbB-4 is the ligand binding surface defined by amino 

35 acids 13-20, 22, 24, 28, 31, 32, 47, 71, 91, 92, 100, 101, 103-105, 128, 130, 
131, and/or the ligand binding surface defined by amino acids 326, 347, 



WO 03/014159 



PCT/AU02/01042 



50 

349-351, 354-359, 383, 385, 409, 410, 411, 412, 415, 417, 418, 439, 441, 466 
and 468. 

9. A method as claimed in claim 8 wherein the compound is selected or 

5 designed to have portions that match residues positioned on the ligand binding 
surface of ErbB-4 defined by amino acids 13-20, 22, 24, 28, 31, 32, 47, 71 , 91, 
92, 100, 101, 103-105, 128, 130, 131, and/or the ligand binding surface of 
ErbB-4 defined by amino acids 326, 347, 349-351, 354-359, 383, 385, 409, 
410, 411, 412, 415, 417, 418, 439, 441, 466 and 468. 

10 

10. A method as claimed in claim 1 wherein the compound is selected or 
designed to interact with a site within 5 A of atomic positions of the EGF 
receptor listed in Appendices III or IV or corresponding regions of other 
members of the EGF receptor family, such that the compound interferes 

15 allosterically with the binding of a natural ligand to a member of the EGF 
receptor family. 

11 . A method of selecting or designing a compound that inhibits the 
formation of active dimers of receptors of the EGF receptor family, the method 

20 comprising: 

(a) assessing the stereochemical complementarity between the 
compound and a topographic region of the receptor, wherein the receptor 
comprises: 

(i) amino acids 1 -501 of the EGF receptor positioned at atomic 

25 coordinates as shown in Appendix I or Appendix II, or structural 

coordinates having a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1 .5A; 

(ii) one or more subsets of said amino acids related to the coordinates 
shown in Appendix I or Appendix II by whole body translations 

30 and/or rotations; or 

(iii) amino acids present in the amino acid sequence of a receptor of 
the EGF receptor family, which form an equivalent three- 
dimensional structure to that of amino acids 1-501 of the EGF 
receptor positioned at atomic coordinates substantially as shown in 

35 Appendix I or Appendix II, or structural coordinates having a root 
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mean square deviation from the backbone atoms of said amino 
acids of not more than 1 .5A, or one or more subsets thereof, 
(b) obtaining a compound which possesses stereochemical 
complementarity to a topographic region of the receptor; and 
5 (c) testing the compound for its ability to inhibit the formation of active 

dimers of the receptors. 

1 2. A method as claimed in claim 1 1 wherein the receptor is EGFR and the 
topographic region of the EGFR to which the compound, or a portion thereof, 

10 has stereochemical complementarity is the dimer interface defined by amino 
acids 38, 86, 194, 195, 204, 205, 230, 239, 242-246, 248-253, 262-265, 275, 
278-280, 282-288 and 318 and/or the dimer interface defined by amino acids 
86, 193, 194, 204, 205, 229, 230, 239, 242, 244-246, 248-253, 262-265, 275, 
278-280 and 282-287. 

15 

13. A method as claimed in claim 12 wherein the compound is selected or 
designed to have portions that match residues positioned on the dimer 
interface of EGFR defined by amino acids 38, 86, 194, 195, 204, 205, 230, 239, 
242-246, 248-253, 262-265, 275, 278-280, 282-288 and 318 and/or the dimer 

20 interface defined by amino acids 86, 1 93, 1 94, 204, 205, 229, 230, 239, 242, 
244-246, 248-253, 262-265, 275, 278-280 and 282-287. 

14. A method as claimed in claim 1 1 wherein the receptor is ErbB-2 and the 
topographic region of the ErbB-2 to which the compound, or a portion thereof, 

25 has stereochemical complementarity is the dimer interface defined by amino 
acids 36, 84, 202, 203, 211, 212, 237, 246, 249-253, 255-260, 269-272, 282, 
285-287, 289-295 and 326 and/or the dimer interface defined by amino acids 
84, 201, 202, 211, 212, 236, 237, 246, 249, 251-253, 255-260, 269-272, 282, 
285-287 and 289-294. 

30 

15. A method as claimed in claim 1 4 wherein the compound is selected or 
designed to have portions that match residues positioned on the dimer 
interface of ErbB-2 defined by amino acids 36, 84, 202, 203, 211, 212, 237, 
246, 249-253, 255-260, 269-272, 282, 285-287, 289-295 and 326 and/or the 

35 dimer interface defined by amino acids 84, 201 , 202, 21 1 , 212, 236, 237, 246, 
249, 251-253, 255-260, 269-272, 282, 285-287 and 289-294. 
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16. A method as claimed in claim 1 1 wherein the receptor is ErbB-3 and the 
topographic region of the ErbB-3 to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
5 acids 41 , 89, 1 94, 1 95, 204, 205, 230, 239, 242-246, 248-253, 262-265, 275, 
278-279, 281-287 and 317 and/or the dimer interface defined by amino acids 
89, 193, 194, 204, 205, 229, 230, 239, 242, 244-246, 248-253, 262-265, 275, 
278-279 and 281-286. 

10 17. A method as claimed in claim 16 wherein the compound is selected or 
designed to have portions that match residues positioned on the dimer 
. interface of ErbB-3 defined by amino acids 41 , 89, 194, 195, 204, 205, 230, 
239, 242-246, 248-253, 262-265, 275, 278-279, 281-287 and 317 and/or the 
dimer interface defined by amino acids 89, 193, 194, 204, 205, 229, 230, 239, 

15 242, 244-246, 248-253, 262-265, 275, 278-279 and 281 -286. 

1 8. A method as claimed in claim 1 1 wherein the receptor is ErbB-4 and the 
topographic region of the ErbB-4 to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
20 acids 40, 88, 1 96, 1 97, 206, 207, 232, 241 , 244-248, 250-255, 264-267, 277, 
280-281 , 283-289 and 319 and/or the dimer interface defined by amino acids 
88, 195, 196, 206, 207, 231, 232, 241, 244, 246-248, 250-255, 264-267, 277, 
280-281 and 283-286. 

25 19. A method as claimed in claim 1 8 which further comprises selecting or 
designing a compound which has portions that match residues positioned on 
the dimer interface of ErbB-4 defined by amino acids 40, 88, 196, 197, 206, 
207, 232, 241, 244-248, 250-255, 264-267, 277, 280-281, 283-289 and 319 
and/or the dimer interface defined by amino acids 88, 195, 196, 206, 207, 231, 

30 232, 241 , 244, 246-248, 250-255, 264-267, 277, 280-281 and 283-286. 

20. A method as claimed in any one of claims 1 1 to 1 9 wherein the 
compound is designed or selected to comprise a first domain which interacts 
with the dimer interface of a first EGF receptor family member and a second 
35 domain which interacts with the dimer interface of a second EGF receptor 
family member. 
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21 . A computer-assisted method for identifying potential compounds able to 
interact with a member of the EGF receptor family and thereby modulate an 
activity mediated by receptor, using a programmed computer comprising a 
5 processor, an input device, and an output device, comprising the steps of: 

(a) inputting into the programmed computer, through the input device, 
data comprising the atomic coordinates of amino acids 1-501 of the 
EGF receptor molecule as shown in Appendix I or Appendix II, or 
structural coordinates having a root mean square deviation from the 

10 backbone atoms of said amino acids of not more than 1 ,5A, or one 

or more subsets of said amino acids, or one or more subsets of 
said amino acids related to the coordinates shown in Appendix I or 
Appendix II by whole body translations and/or rotations; 

(b) generating, using computer methods, a set of atomic coordinates of 
15 a structure that possesses stereochemical complementarity to a 

topographic region of the EGF receptor molecule, wherein the EGF 
receptor molecule is characterised by the atomic coordinates of 
amino acids 1-501 as shown in Appendix I or Appendix II, or 
structural coordinates having a root mean square deviation from the 

20 backbone atoms of said amino acids of not more than 1 .5A, or one 

or more subsets of said amino acids, or one or more subsets of 
said amino acids related to the coordinates shown in Appendix I or 
Appendix II by whole body translations and/or rotations, thereby 
generating a criteria data set; 

25 (c) comparing, using the processor, the criteria data set to a computer 

database of chemical structures; 

(d) selecting from the database, using computer methods, chemical 
structures which are similar to a portion of said criteria data set; and 

(e) outputting, to the output device, the selected chemical structures 
30 which are complementary to or similar to a portion of the criteria 

data set. 

22. A method as claimed in claim 21 wherein the receptor is EGFR and the 
topographic region of EGFR is the ligand binding surface defined by amino 
35 acids 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127 and 
128, and/or the ligand binding surface defined by amino acids 325, 346, 348- 
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350, 353-358, 382, 384, 408, 409, 411, 412, 415, 417, 418, 438, 440, 465 and 
467. 

23. A method as claimed in claim 22 wherein the compound is selected or 

5 designed to have portions that match residues positioned on the ligand binding 
surface of EGFR defined by amino acids 1.1-18, 20, 22, 26, 29, 30, 45, 69, 89, 
90, 98, 99, 101-103, 125, 127 and 128, and/or the ligand binding surface of 
EGFR defined by amino acids 325, 346, 348-350, 353-358, 382, 384, 408, 409, 
411,41 2, 41 5, 41 7, 41 8, 438, 440, 465 and 467. 

10 

24. A method as claimed in claim 21 wherein the receptor is ErbB-2 and the 
topographic region of ErbB2 is the surface defined by amino acids 9-16, 18, 20, 

24. 27, 28,- 43, 67, 87, 88, 96, 97, 99-101, 133, 135 and 136, and/or the surface 
defined by amino acids 333, 354, 359-358, 361-366, 390, 392, 416, 417, 419, 

15 420, 423, 425, 426, 446, 448, 473 and 475. 

25. A method as claimed in claim 24 wherein the compound is selected or 
designed to have portions that match residues positioned on the surface of 
ErbB2 defined by amino acids 9-16, 18, 20, 24, 27, 28, 43, 67, 87, 88, 96, 97, 

20 99-1 01 , 1 33, 1 35 and 1 36, and/or the surface of ErbB2 defined by amino acids 
333, 354, 359-358, 361-366, 390, 392, 416, 417, 419, 420, 423, 425, 426, 446, 
448, 473 and 475. 

26. A method as claimed in claim 21 wherein the receptor is ErbB-3 and the 
25 topographic region of ErbB-3 is the ligand binding surface defined by amino 

acids 14-21, 23, 25, 29, 32, 33, 48, 72, 92, 93, 101, 102, 104-106, 129, 131 
and 132, and/or the ligand binding surface defined by amino acids 322, 343, 
345-347, 350-355, 379, 381, 405, 406, 408, 409, 412, 414, 415, 436, 438, 464 
and 466. 

30 

27. A method as claimed in claim 26 wherein the compound is selected or 
designed to have portions that match residues positioned on the ligand binding 
surface of ErbB-3 defined by amino acids 14-21, 23, 25, 29, 32, 33, 48, 72, 92, 
93, 101, 102, 104-106, 129, 131 and 132, and/or the ligand binding surface of 

35 ErbB-3 defined by amino acids 322, 343, 345-347, 350-355, 379, 381 , 405, 
406, 408, 409, 412, 414, 415, 436, 438, 464 and 466. 
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28. A method as claimed in claim 21 wherein the receptor is ErbB-4 and the 
topographic region of ErbB-4 is the ligand binding surface defined by amino 
acids 13-20, 22, 24, 28, 31, 32, 47, 71, 91, 92, 100, 101, 103-105, 128, 130, 

5 1 31 , and/or the ligand binding surface defined by amino acids 326, 347, 

349-351, 354-359, 383, 385, 409, 410, 411, 412, 415, 417, 418, 439, 441, 466 
and 468. 

29. A method as claimed in claim 28 wherein the compound is selected or 
10 designed to have portions that match residues positioned on the ligand binding 

surface of ErbB-4 defined by amino acids 1 3-20, 22, 24, 28, 31 , 32, 47, 71,91, 
92, 100, 101, 103-105, 128, 130, 131, and/or the ligand binding surface of 
ErbB-4 defined by amino acids 326, 347, 349-351, 354-359, 383, 385, 409, 
41 0, 41 1 , 41 2, 41 5, 41 7, 41 8, 439, 441 , 466 and 468. 

15 

30. A method as claimed in claim 21 wherein the receptor is EGFR and the 
topographic region of the EGFR to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
acids 38, 86, 194, 195, 204, 205, 230, 239, 242-246, 248-253, 262-265, 275, 

20 278-280, 282-288 and 318 and/or the dimer interface defined by amino acids 
86, 193, 194, 204, 205, 229, 230, 239, 242, 244-246, 248-253, 262-265, 275, 
278-280 and 282-287. 

31 . A method as claimed in claim 30 wherein the compound is selected or 
25 designed to have portions that match residues positioned on the dimer 

interface of EGFR defined by amino acids 38, 86, 194, 195, 204, 205, 230, 239, 
242-246, 248-253, 262-265, 275, 278-280, 282-288 and 318 and/or the dimer 
interface defined by amino acids 86, 193, 194, 204, 205, 229, 230, 239, 242, 
244-246, 248-253, 262-265, 275, 278-280 and 282-287. 

30 

32. A method as claimed in claim 21 wherein the receptor is ErbB-2 and the 
topographic region of the ErbB-2 to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
acids 36, 84, 202, 203, 21 1, 212, 237, 246, 249-253, 255-260, 269-272, 282, 

35 285-287, 289-295 and 326 and/or the dimer interface defined by amino acids 
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84, 201 , 202, 21 1 , 212, 236, 237, 246, 249, 251-253, 255-260, 269-272, 282, 
285-287 and 289-294. 

33. A method as claimed in claim 32 wherein the compound is selected or 
5 designed to have portions that match residues positioned on the dimer 

interface of ErbB-2 defined by amino acids 36, 84, 202, 203, 211, 212, 237, 
246, 249-253, 255-260, 269-272, 282, 285-287, 289-295 and 326 and/or the 
dimer interface defined by amino acids 84, 201 , 202, 21 1 , 212, 236, 237, 246, 
249, 251-253, 255-260, 269-272, 282, 285-287 and 289-294. 

10 

34. A method as claimed in claim 21 wherein the receptor is ErbB-3 and the 
topographic region of the ErbB-3 to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
acids 41, 89, 194, 195, 204, 205, 230, 239, 242-246, 248-253, 262-265, 275, 

15 278-279, 281-287 and 31 7 and/or the dimer interface defined by amino acids 
89, 193, 194, 204, 205, 229, 230, 239, 242, 244-246, 248-253, 262-265, 275, 
278-279 and 281-286. 

35. A method as claimed in claim 34 wherein the compound is selected or 
20 designed to have portions that match residues positioned on the dimer 

interface of ErbB-3 defined by amino acids 41, 89, 194, 195, 204, 205, 230, 
239, 242-246, 248-253, 262-265, 275, 278-279, 281-287 and 317 and/or the 
dimer interface defined by amino acids 89, 193, 194, 204, 205, 229, 230, 239, 
242, 244-246, 248-253, 262-265, 275, 278-279 and 281-286. 

25 

36. A method as claimed in claim 21 wherein the receptor is ErbB-4 and the 
topographic region of the ErbB-4 to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
acids 40, 88, 196, 197, 206, 207, 232, 241, 244-248, 250-255, 264-267, 277, 

30 280-281 , 283-289 and 31 9 and/or the dimer interface defined by amino acids 
88, 195, 196, 206, 207, 231, 232, 241, 244, 246-248, 250-255, 264-267, 277, 
280-281 and 283-286. 

37. A method as claimed in claim 36 which further comprises selecting or 
35 designing a compound which has portions that match residues positioned on 

the dimer interface of ErbB-4 defined by amino acids 40, 88, 196, 197, 206, 
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207, 232, 241, 244-248, 250-255, 264-267, 277, 280-281 , 283-289 and 319 
and/or the dimer interface defined by amino acids 88, 195, 196, 206, 207, 231, 
232, 241, 244, 246-248, 250-255, 264-267, 277, 280-281 and 283-286. 

5 38. A method as claimed in any one of claims 21 to 37 which further 

comprises the step of obtaining a compound with a chemical structure selected 
in steps (d) and (e), and testing the compound for the ability to decrease an 
activity mediated by the receptor. 

10 39. A method as claimed in claim 38 wherein the test is carried out in vitro. 

40. A method as claimed in claim 39 wherein the in vitro test is a high 
throughput assay. 

15 41 . A method as claimed in claim 38 wherein the test is carried out in vivo. 

42. A method as claimed in any one of claims 1 to 41 wherein the 
stereochemical complementarity between the compound and the receptor is 

20 such that the compound has a Kd for the receptor site of less than 1 0" 6 M. 

43. A method as claimed in any one of claims 1 to 41 wherein the 
stereochemical complementarity between the compound and the receptor is 
such that the compound has a Kd for the receptor site of less than 10" 8 M 

25 

44. A method as claimed in any one of claims 1 to 41 wherein the 
stereochemical complementarity between the compound and the receptor is 
such that the compound has a Kd for the receptor site of less than 10- 9 M. 

30 45. A method as claimed in any one of claims 1 to 44 wherein the compound 
is selected or modified from a known compound identified from a data base. 

46. A method as claimed in any one of claims 1 to 45 wherein the method is 
used to identify potential compounds which have the ability to decrease an 
35 activity mediated by the receptor. 
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47. A computer for producing a three-dimensional representation of a 
molecule or molecular complex, wherein the computer comprises: 

(a) a machine-readable data storage medium comprising a data 
storage material encoded with machine-readable data, wherein the 
machine readable data comprise the atomic coordinates of amino 
acids 1-501 of the EGF receptor molecule as shown in Appendix I 
or Appendix II, or structural coordinates having a root mean square 
deviation from the backbone atoms of said amino acids of not more 
than 1 .5A, or one or more subsets of said amino acids, or one or 
more subsets of said amino acids related to the coordinates shown 
in Appendix I or Appendix II by whole body translations and/or 
rotations; 

(b) a working memory for storing instructions for processing the 
machine-readable data; 

(c) a central-processing unit coupled to the working memory and to the 
machine-readable data storage medium, for processing the 
machine-readable data into the three dimensional representation; 
and 

(d) an output hardware coupled to the central processing unit, for 
receiving the three-dimensional representation. 

48. A computer as claimed in claim 47 wherein the subset of amino acids are 
the amino acids (i) defining either or both the ligand binding surface(s), or (ii) 
defining dimerization interface. 

49. A compound able to interact with a member of the EGF receptor family 
and to modulate an activity mediated by the receptor, the compound being 
obtained by a method according to any one of claims 1 to 46. 

50. A compound as claimed in claim 49 which is a mutant of the natural 
ligand of a receptor of the EGF receptor family, where at least one mutation 
occurs in the region of the natural ligand which interacts with the receptor. 

51 . A pharmaceutical composition for preventing or treating a disease 
associated with signaling by a molecule of the EGF receptor family which 
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comprises a compound according to claim 49 or claim 50 and a 
pharmaceutical^ acceptable carrier or diluent. 

52. A method of preventing or treating a disease associated with signaling by 
5 a molecule of the EGF receptor family which method comprises administering 
to a subject in need thereof a compound identified by a method comprising the 
step of the step of assessing the stereochemical complementarity between the 
compound and a topographic region of the receptor, wherein the receptor is 
characterised by> 

10 (i) amino acids 1 -501 of the EGF receptor positioned at atomic 

coordinates as shown in Appendix I or Appendix II, or structural 
coordinates having a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1 .5A; 

(ii) one or more subsets of said amino acids related to the coordinates 
15 shown in Appendix I or Appendix II by whole body translations 

and/or rotations; or 

(iii) amino acids present in the amino acid sequence of a member of 
the EGF receptor family, which form an equivalent three- 
dimensional structure to that of amino acids 1-501 of the EGF 

20 receptor positioned at atomic coordinates substantially as shown in 

Appendix I or Appendix II, or structural coordinates having a root 
mean square deviation from the backbone atoms of said amino 
acids of not more than 1 .5A, or a subset thereof. 

25 53. A method as claimed in claim 52 wherein the receptor is EGFR and the 
topographic region of EGFR is the ligand binding surface defined by amino 
acids 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101-103, 125, 127 and 
128, and/or the ligand binding surface defined by amino acids 325, 346, 348- 
350, 353-358, 382, 384, 408, 409, 41 1, 412, 415, 417, 418, 438, 440, 465 and 

30 467. 

54. A method as claimed in claim 53 wherein the compound is selected or 
designed to have portions that match residues positioned on the ligand binding 
surface of EGFR defined by amino acids 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 
35 90, 98, 99, 101-103, 125, 127 and 128, and/or the ligand binding surface of 
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EGFR defined by amino acids 325, 346, 348-350, 353-358, 382, 384, 408, 409, 
41 1, 412, 415, 417, 418, 438, 440, 465 and 467. 

55. A method as claimed in claim 52 wherein the receptor is ErbB-2 and the 
5 topographic region of ErbB2 is the surface defined by amino acids 9-16, 18, 
20, 24, 27, 28, 43, 67, 87, 88, 96, 97, 99-101, 133, 135 and 136, and/or the 
surface defined by amino acids 333, 354, 359-358, 361-366, 390, 392, 416, 
417, 419, 420, 423, 425, 426, 446, 448, 473 and 475. 

10 56. A method as claimed in claim 55 wherein the compound is selected or 
designed to have portions that match residues positioned on the surface of 
ErbB2 defined by amino acids 9-16, 18, 20, 24, 27, 28, 43, 67, 87, 88, 96, 97, 
99-101, 133, 135 and 136, and/or the surface of ErbB2 defined by amino acids 
333, 354, 359-358, 361-366, 390, 392, 416, 417, 419, 420, 423, 425, 426, 446, 

15 448, 473 and 475. 

57. A method as claimed in claim 52 wherein the receptor is ErbB-3 and the 
topographic region of ErbB-3 is the ligand binding surface defined by amino 
acids 14-21, 23, 25, 29, 32, 33, 48, 72, 92, 93, 101, 102, 104-106, 129, 131 

20 and 132, and/or the ligand binding surface defined by amino acids 322, 343, - 
345-347, 350-355, 379, 381, 405, 406, 408, 409, 412, 414, 415, 436, 438, 464 
and 466. 

58. A method as claimed in claim 57 wherein the compound is selected or 
25 designed to have portions that match residues positioned on the ligand binding 

surface of ErbB-3 defined by amino acids 14-21, 23, 25, 29, 32, 33, 48, 72, 92, 
93, 101, 102, 104-106, 129, 131 and 132, and/or the ligand binding surface of 
ErbB-3 defined by amino acids 322, 343, 345-347, 350-355, 379, 381, 405, 
406, 408, 409, 412, 414, 415, 436, 438, 464 and 466. 

30 

59. A method as claimed in claim 52 wherein the receptor is ErbB-4 and the 
topographic region of ErbB-4 is the ligand binding surface defined by amino 
acids 13-20, 22, 24, 28, 31, 32, 47, 71, 91, 92, 100, 101, 103-105, 128, 130, 
131, and/or the ligand binding surface defined by amino acids 326, 347, 

35 349-351 , 354-359, 383, 385, 409, 410, 411, 412, 415, 417, 418, 439, 441, 466 
and 468. 
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60. A method as claimed in claim 59 wherein the compound is selected or 
designed to have portions that match residues positioned on the ligand binding 
surface of ErbB-4 defined by amino acids 1 3-20, 22, 24, 28, 31 , 32, 47, 71, 91, 

5 92, 100, 101, 103-105, 128, 130, 131, and/or the ligand binding surface of 
ErbB-4 defined by amino acids 326, 347, 349-351 , 354-359, 383, 385, 409, 
410, 411, 412, 415, 417, 418, 439, 441, 466 and 468. 

61 . A method as claimed in claim 52 wherein the compound is selected or 
10 designed to interact with a site within 5 A of atomic positions of the EGF 

receptor listed in Appendices III or IV or corresponding regions of other 
members of the EGF receptor family, such that the compound interferes 
allosterically with the binding of a natural ligand to a member of the EGF 
receptor family. 

15 

62. A method as claimed in claim 52 wherein the receptor is EGFR and the 
topographic region of the EGFR to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
acids 38, 86, 194, 195, 204, 205, 230, 239, 242-246, 248-253, 262-265, 275, 

20 278-280, 282-288 and 31 8 and/or the dimer interface defined by amino acids 
86, 193, 194, 204, 205, 229, 230, 239, 242, 244-246, 248-253, 262-265, 275, 
278-280 and 282-287. 

63. A method as claimed in claim 62 wherein the compound is selected or 
25 designed to have portions that match residues positioned on the dimer 

interface of EGFR defined by amino acids 38, 86, 194, 195, 204, 205, 230, 239, 
242-246, 248-253, 262-265, 275, 278-280, 282-288 and 318 and/or the dimer 
interface defined by amino acids 86, 193, 194, 204, 205, 229, 230, 239, 242, 
244-246, 248-253, 262-265, 275, 278-280 and 282-287 . 

30 

64. A method as claimed in claim 52 wherein the receptor is ErbB-2 and the 
topographic region of the ErbB-2 to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
acids 36, 84, 202, 203, 21 1,212, 237, 246, 249-253, 255-260, 269-272, 282, 

35 285-287, 289-295 and 326 and/or the dimer interface defined by amino acids 
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84, 201, 202, 211, 212, 236, 237, 246, 249, 251-253, 255-260, 269-272, 282, 
285-287 and 289-294. 

65. A method as claimed in claim 64 which further comprises selecting or 
5 designing a compound which has portions that match residues positioned on 

the dimer interface of ErbB-2 defined by amino acids 36, 84, 202, 203, 21 1 , 
212, 237, 246, 249-253, 255-260, 269-272, 282, 285-287, 289-295 and 326 
and/or the dimer interface defined by amino acids 84, 201, 202, 211, 212, 236, 
237, 246, 249, 251-253, 255-260, 269-272, 282, 285-287 and 289-294. 

10 

66. A method as claimed in claim 52 wherein the receptor is ErbB-3 and the 
topographic region of the ErbB-3 to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
acids 41, 89, 194, 195, 204, 205, 230, 239, 242-246, 248-253, 262-265, 275, 

15 278-279, 281-287 and 317 and/or the dimer interface defined by amino acids 
89, 193, 194, 204, 205, 229, 230, 239, 242, 244-246, 248-253, 262-265, 275, 
278-279 and 281-286. 

67. A method as claimed in claim 66 which further comprises selecting or 
20 designing a compound which has portions that match residues positioned on 

the dimer interface of ErbB-3 defined by amino acids 41, 89, 194, 195, 204, 
205, 230, 239, 242-246, 248-253, 262-265, 275, 278-279, 281-287 and 317 
and/or the dimer interface defined by amino acids 89, 193, 194, 204, 205, 229, 
230, 239, 242, 244-246, 248-253, 262-265, 275, 278-279 and 281-286. 

68. A method as claimed in claim 52 wherein the receptor is ErbB-4 and the 
topographic region of the ErbB-4 to which the compound, or a portion thereof, 
has stereochemical complementarity is the dimer interface defined by amino 
acids 40, 88, 196, 197, 206, 207, 232, 241, 244-248, 250-255, 264-267, 277, 
280-281 , 283-289 and 319 and/or the dimer interface defined by amino acids 
88, 195, 196, 206, 207, 231, 232, 241, 244, 246-248, 250-255, 264-267, 277, 
280-281 and 283-286. 

69. A method as claimed in claim 68 which further comprises selecting or 
designing a compound which has portions that match residues positioned on 
the dimer interface of ErbB-4 defined by amino acids 40, 88, 196, 197, 206, 
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207, 232, 241, 244-248, 250-255, 264-267, 277, 280-281, 283-289 and 319 
and/or the dimer interface defined by amino acids 88, 195, 196, 206, 207, 231 , 
232, 241, 244, 246-248, 250-255, 264-267, 277, 280-281 and 283-286. 

5 70. A method as claimed in any one of claims 62 to 69 wherein the 

compound is designed or selected to comprise a first domain which interacts 
with the dimer interface of a first EGF receptor family member and a second 
domain which interacts with the dimer interface of a second EGF receptor 
family member. 

10 

71 . A method as claimed in any one of claims 52 to 70 wherein the 
stereochemical complementarity between the compound and the receptor is 
such that the compound has a Kd for the receptor site of less than 10" 6 M. 

15 72. A method as claimed in any one of claims 52 to 71 wherein the 

stereochemical complementarity between the compound and the receptor is 
such that the compound has a Kd for the receptor site of less than 10" 8 M 

73. A method as claimed in any one of claims 52 to 72 wherein the 

20 stereochemical complementarity between the compound and the receptor is 
such that the compound has a Kd for the receptor site of less than 10* 9 M. 

74. A method as claimed in any one of claims 52 to 73 wherein the 
compound is selected or modified from a known compound identified from a 

25 data base. 

75. A method as claimed in any one of claims 52 to 74 wherein the disease 
is selected from the group consisting of psoriasis and tumour states. 

30 76. A method as claimed in claim 75 wherein the tumour state is selected 
from the group consisting of cancer of the breast, brain, colon, prostate, ovary, 
cervix, pancreas, lung, head and neck, and melanoma, rhabdomyosarcoma, 
mesothelioma, squamous carcinomas of the skin and glioblastoma. 

35 77. A method for evaluating the ability of a chemical entity to bind to EGFR, 
said method comprising the steps of: 
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5 



(a) creating a computer model of at least one region of EGFR using 
structure coordinates wherein the root mean square deviation 
between said structure coordinates and the structure coordinates of 
amino acids 1 -501 of EGFR as set forth in Appendix I or Appendix 
II is not more than about 1 .5 A; 



(b) employing computational means to perform a fitting operation 
between the chemical entity and said computer model of the 
binding surface; and 



10 



(c) analysing the results of said fitting operation to quantify the 

association between the chemical entity and the binding surface 
model. 



78. A method of utilizing molecular replacement to obtain structural 
information about a molecule or a molecular complex of unknown structure, 
15 comprising the steps of: 



79. A crystalline composition comprising amino acids 1-501 of the EGF 
25 receptor or a portion thereof. 

80. A method of assessing the interaction between a compound and the 
EGF receptor, the method comprising exposing a crystalline composition 
comprising amino acids 1-501 of the EGF receptor or a portion thereof to the 

30 compound and measuring the level of binding to the crystal. 

81 . A polypeptide complex in a crystallized form comprising the amino acids 
1-501 of EGFR and TGFa. 

35 82. A variant of a ligand of the EGF receptor family in which the sequence of 
the ligand is modified such that the ability to interact with the L1 domain of the 



20 



- (0 
(ii) 



(iii) 



crystallising said molecule or molecular complex; 
generating an X-ray diffraction pattern from said crystallized 
molecule or molecular complex; 

applying at least a portion of the structure coordinates set forth in 
Appendix I or Appendix II to the X-ray diffraction pattern to generate 
a three-dimensional electron density map of at least a portion of the 
molecule or molecular complex whose structure is unknown. 
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member of the EGF receptor family is retained or increased and the ability to 
interact with the L2 domain of the member of the EGF receptor family is 
removed or decreased, or vice versa. 

5 83. A variant of a ligand of the EGF receptor family in which the sequence of 
the ligand is modified such that the ability to interact with the L1 domain of a 
member of the EGF receptor family is retained or increased and the ability to 
interact with the L2 domain of a member of the EGF receptor family is retained 
or increased, with the proviso that the binding to at least one of L1 or L2 is 

10 increased. 

84 A variant as claimed in claim 82 or claim 83 in which the ligand is 
selected from the group consisting of EGF, TGF-a, amphiregulin, HB-EGF, 
betacellulin, epiregulin, epigen, NRG1a, NRG1p\ NRG2a, NRGp, NRG3 and 
15 NRG4. 

85. A variant as claimed in claim 84 wherein the ligand is TGFa. 

86. A TGFa variant as claimed in claim 85 wherein the TGFa is modified at 
20 one more amino acids selected from the group consisting of amino acids 3-5, 8, 

9, 1 1-15, 17, 18, 22, 24, 26, 27, 29-34, 36 and 38-50. 

87. An extracellular fragment of EGFR, wherein the fragment is modified at 
one or more amino acids selected from the group consisting of 

25 (i) amino acids 11-18, 20, 22, 26, 29, 30, 45, 69, 89, 90, 98, 99, 101 1 03, 

125, 127, 128, 325, 346, 348-350, 353-358, 382, 384, 408, 409, 411, 
412, 415, 417, 418, 438, 440, 465 and 467, or 

(ii) amino acids 38, 86, 193-195, 204, 205, 229, 230, 239, 242-246, 248- 
253, 262-265, 275, 278-280, 282-288 and 318, 
30 wherein the modification increases the affinity of the fragment for one or more 
of its natural ligands when compared to the unmodified fragment. 

88. An extracellular fragment of ErbB-2, wherein the fragment is modified at 
one or more amino acids selected from the group consisting of 
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(i) amino acids 9-16, 18, 20, 24, 27, 28, 43, 67, 87, 88, 96, 97, 99-101, 
133, 135, 136, 333, 354, 359-358, 361-366, 390, 392, 416, 417, 419, 
420, 423, 425, 426, 446, 448, 473 and 475, or 

(ii) amino acids 36, 84, 201-203, 211, 212, 236, 237, 246, 249-253, 255- 
5 260, 269-272, 282, 285-287, 289-295 and 326. 

wherein the modification increases the affinity of the fragment for one or more 
of its natural ligands when compared to the unmodified fragment. 

89. An extracellular fragment of ErbB-3, wherein the fragment is modified at 
10 one or more amino acids selected from the group consisting of 

(i) amino acids 14-21, 23, 25, 29, 32, 33, 48, 72, 92, 93, 101, 102, 104- 
106, 129, 131, 132, 322, 343, 345-347, 350-355, 379, 381, 405, 406, 
408, 409, 412, 414, 415, 436, 438, 464 and 466, or 

(ii) amino acids 41, 89, 193-195, 204, 205, 229, 230, 239, 242-246, 248- 
15 253, 262-265, 275, 278-279, 281 -287, 31 7. 

wherein the modification increases the affinity of the fragment for one or more 
of its natural ligands when compared to the unmodified fragment. 

90. An extracellular fragment of ErbB-4, wherein the fragment is modified at 
20 one or more amino acids selected from the group consisting of 

(i) amino acids 13-20, 22, 24, 28, 31, 32, 47, 71, 91, 92, 100, 101, 103- 
105, 128, 130, 131, 326, 347, 349-351, 354-359, 383, 385, 409, 410, 
41 1 , 41 2, 41 5, 41 7, 41 8, 439, 441 , 466 and 468, or 

(ii) amino acids 40, 88, 195-197, 206, 207, 231, 232, 241, 244-248, 250- 
25 255, 264-267, 277, 280-281 , 283-289 and 319. 

wherein the modification increases the affinity of the fragment for one or more 
of its natural ligands when compared to the unmodified fragment. 

91 . An extracellular fragment of EGFR wherein the fragment is modified at 
30 one or more amino acids of EGFR selected from the group consisting of: 

(i) amino acids 5, 6, 8-10, 19, 21-25, 28, 32, 33, 38, 39, 40, 42, 44, 47, 
48, 50, 63, 64, 66, 68, 71, 73, 87, 88, 91-94, 96, 104-107, 109, 123, 130, 
131, 151-160, 315-324, 326, 328, 329, 331, 332, 343, 344, 351, 359- 
363, 379, 380, 385, 387, 388, 394, 404-407, 410, 413, 420, 434-436, 
35 440, 441 , 443, 448, 449, 461 -464, 466-468; or 
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(ii) amino acids 1-6, 8,9, 1 1 , 30, 35, 36, 39, 40, 60, 62-64, 82, 84, 85, 87- 
89, 94, 118, 120-122, 148, 187-193, 196-198, 200-203, 209-211, 213, 
215, 217-221, 231-233, 235, 237, 238, 241, 243, 244, 247, 254-261, 
266, 268-270, 272-274, 276, 277, 281, 289-297, 299-301, 303, 304, 311, 
5 31 2, 31 4-31 7, 31 9-323, 335, 340, 342-344, 346, 376, 378-380, 403-412, 

434, 459, 

wherein the modification increases the affinity of the fragment for one or more 
of its natural ligands when compared to the unmodified fragment. 

10 92. An extracellular fragment of ErbB-2 wherein the fragment is modified at 
one or more amino acids of ErbB-2 selected from the group consisting of: 

(i) amino acids 3,4, 6-8, 17, 19-23, 26, 30, 31, 36, 37, 38, 40, 42, 45, 46, 
48, 61, 62, 64, 66, 69, 71, 85, 86, 89-92, 94, 102-115, 117, 131, 138, 
139, 159-168, 323-323, 334, 336, 337, 339, 340, 351, 352, 359, 367- 

15 371 , 387, 388, 393, 395, 397, 402, 412-415, 418, 421 , 428, 442-444, 

448, 449, 451, 456, 457, 469-472, and 472-476, or 

(ii) amino acids 1-4, 6, 7, 9, 28, 33, 34, 37, 38, 58, 60-62, 80, 82, 83, 85- 
87, 92, 126, 128-130, 156, 195-201, 204-206, 208-211, 217-219, 221, 
223, 225-229, 239-241, 243, 245, 246, 249, 251, 252, 255, 262-269, 

20 274, 276-278, 280-282, 284, 285, 289, 297-305, 307-309, 31 1 , 312, 319, 

320, 322-325, 327-231, 343, 348, 350-352, 354, 384, 386-388, 411-420, 
442, and 467, 

wherein the modification increases the affinity of the fragment for one or more 
of its natural ligands when compared to the unmodified fragment. 

25 

93. An extracellular fragment of ErbB-3 wherein the fragment is modified at 
one or more amino acids of ErbB-3 selected from the group consisting of: 

(i) amino acids 8, 9, 11-13, 22, 24-28, 31, 35, 36, 41, 42, 43, 45, 47, 50, 
51, 53, 66, 67, 69, 71, 74, 76, 90, 91, 94-97, 99, 107-111, 113, 127, 134, 

30 135, 154-159, 314-321, 323, 325, 326, 328, 329, 340, 341, 348, 356- 

360, 376, 373, 382, 384, 385, 391, 401-404, 407, 410, 418, 432-434, 
438, 439, 441, 446, 447, 459-462, and 464-466, or 

(ii) amino acids 4-9, 11, 12, 14, 33, 38, 39, 42, 43, 63, 65-67, 85, 87, 88, 
90-92, 97, 122, 124-126, 152, 187-193, 196-198, 200-203, 209-211, 213, 

35 215, 217-221, 231-233, 235, 237, 238, 241, 243, 244, 247, 254-261, 

266, 268-270, 272-274, 276, 277, 280, 288-296, 298-300, 302, 303, 310, 
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311, 313-316, 318-320, 332, 337, 339-341, 343, 373, 375-377, 400-409, 
432 and 457, 

wherein the modification increases the affinity of the fragment for one or more 
of its natural ligands when compared to the unmodified fragment. 

5 

94. An extracellular fragment of ErbB-4 wherein the fragment is modified at 
one or more amino acids of ErbB-4 selected from the group consisting of: 

(i) amino acids 7, 8, 10-12, 21, 23-27, 30, 34, 35, 40, 41, 42, 44, 46, 49, 
50, 52, 65, 66, 68, 70, 73, 75, 89, 90, 93-96, 98, 106-110, 112, 126, 133, 

10 134, 154-163, 316-325, 327, 329, 330, 332, 333, 344, 345, 352, 360- 

364, 380, 381, 386, 388, 389, 395, 405-408, 413, 421, 435-437, 441, 
442, 444, 449, 450, 462-465 and 467-469 or 

(ii) amino acids 3-8, 10, 11 13, 32, 37, 38, 41, 42, 62, 64-66, 84, 86, 87, 
89-91, 96, 121, 123-125, 151, 189-195, 198-200, 202-205, 207-213, 215, 

15 21 7, 21 9-223, 233-235, 237, 239, 240, 243, 245, 246, 249, 256-263, 

268, 270-272, 274-276, 278, 279, 282, 290-298, 300-302, 304, 305, 312, 
313, 315-318, 320-324, 336, 341, 343-345, 347, 377,. 379-381, 404-412, 
435 and 460 

wherein the modification increases the affinity of the fragment for one or more 
20 of its natural ligands when compared to the unmodified fragment. 

95. A compound comprising fragment 1 -501 of EGFR or an equivalent 
fragment of a member of the EGF receptor family, wherein the fragment is 
modified to induce dimerisation of the fragment in back-to-back configuration. 

25 

96. A compound as claimed in claim 95 wherein the modification is made to 
a residue of the fragment which forms part of the back-to-back dimer interface. 

97. A compound as claimed in claim 96 wherein the modification involves 
30 substitution of at least one residue which forms part of the back to back dimer 

with a cysteine residue. 

98. A compound comprising fragment 1 -501 of EGFR wherein the fragment 
comprises the substitution P248C and/or A265C. 

35 
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99. A compound comprising fragment 1 -501 of EGFR wherein the fragment 
comprises the substitution D279C. 

1 00. A compound as claimed in claim 95 wherein the modification involves 
5 insertion of a dimerization sequence into the fragment. 

101. A compound as claimed in claim 1 00 wherein the dimerization sequence 
is inserted between residues 194 and 195 or between residues 204 and 205 of 
EGFR or equivalent residues of another member of the EGF receptor family. 

10 

1 02. A compound as claimed in any one of claims 95 to 1 01 wherein the 
fragment is conjugated to a molecule. 

1 03. A compound as claimed in claim 1 02 wherein the molecule is a constant 
15 domain of an immunoglobulin molecule. 

104. An antibody which binds to EGFR, the antibody being directed against (i) 
EGFR residues 100-108, 315-327 or 353-362; or (ii) EGFR residues 190-207, 
240-305 or parts thereof. 

20 

105. An antibody which binds to ErbB-2, the antibody being directed against 
(i) ErbB-2 residues 98-1 16, 323-335 or 361-374; or (ii) ErbB-2 residues 198- 
214, 247-313 or parts thereof. 

25 1 06. An antibody which binds to ErbB-3, the antibody being directed against 
(i) ErbB-3 residues 103-112, 314-324 or 350-363; or (ii) ErbB-3 residues 190- 
207, 240-304 or parts thereof. 

107. An antibody which binds to ErbB-4, the antibody being directed against 
30 (i) ErbB-4 residues 102-111, 316-328, 354-367; or (ii) ErbB-4 residues 192- 
209, 242-306 or parts thereof. 
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Figure 3 
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Figure 4A 
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Figure 4B 
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Figure 5 
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Figure 6 
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<210> 1 

<211> 501 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Leu Glu Glu Lys Lys Val Cys Gin Gly Thr Ser Asn Lys Leu Thr Gin 
1 5 ** 10 15 



Leu Gly Thr Phe Glu Asp His Phe Leu Ser Leu Gin Arg Met Phe Asn 
20 25 30 



Asn Cys Glu Val Val Leu Gly Asn Leu Glu He Thr Tyr Val Gin Arg 
35 40 45 



Asn Tyr Asp Leu Ser Phe Leu Lys Thr He Gin Glu Val Ala Gly Tyr 
50 55 60 



Val Leu He Ala Leu Asn Thr Val Glu Arg He Pro Leu Glu Asn Leu 
65 70 75 80 



Gin He He Arg Gly Asn Met Tyr Tyr Glu Asn Ser Tyr Ala Leu Ala 
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85 



90 



95 



Val Leu Ser Asn Tyr Asp Ala Asn Lys Thr Gly Leu Lys Glu Leu Pro 
100 105 110 



Met Arg Asn Leu Gin Glu lie Leu His Gly Ala Val Arg Phe Ser Asn 
115 120 125 



Asn Pro Ala Leu Cys Asn Val Glu Ser lie Gin Trp Arg Asp lie Val 
130 "* 135 140 



Ser Ser Asp Phe Leu Ser Asn Met Ser Met Asp Phe Gin Asn His Leu 
145 150 155 160 



Gly Ser Cys Gin Lys Cys Asp Pro Ser Cys Pro Asn Gly Ser Cys Trp 
165 170 175 



Gly Ala Gly Glu Glu Asn Cys Gin Lys Leu Thr Lys lie lie Cys Ala 
180 185 190 



Gin Gin Cys Ser Gly Arg Cys Arg Gly Lys Ser Pro Ser Asp Cys Cys 
195 200 205 



His Asn Gin Cys Ala Ala Gly Cys Thr Gly Pro Arg Glu Ser Asp Cys 
210 215 220 



Leu Val Cys Arg Lys Phe Arg Asp Glu Ala Thr Cys Lys Asp Thr Cys 
225 230 235 240 



Pro Pro Leu Met Leu Tyr Asn Pro Thr Thr Tyr Gin Met Asp Val Asn 
245 ^ 250 255 



Pro Glu Gly Lys Tyr Ser Phe Gly Ala Thr Cys Val Lys Lys Cys Pro 
260 265 270 



Arg Asn Tyr Val Val Thr Asp His Gly Ser Cys Val Arg Ala Cys Gly 
275 280 285 



Ala Asp Ser Tyr Glu Met Glu Glu Asp Gly Val Arg Lys Cys Lys Lys 
290 295 300 



Cys Glu Gly Pro Cys Arg Lys Val Cys Asn Gly lie Gly lie Gly Glu 
305 "* ' 310 ^ 315 320 



Phe Lys Asp Ser Leu Ser He Asn Ala Thr Asn He Lys His Phe Lys 
325 330 335 
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Asn Cys Thr Ser lie Ser Gly Asp Leu His lie Leu Pro Val Ala Phe 
340 345 350 



Arg Gly Asp Ser Phe Thr His Thr Pro Pro Leu Asp Pro Gin Glu Leu 
355 360 365 



Asp lie Leu Lys Thr Val Lys Glu lie Thr Gly Phe Leu Leu lie Gin 
370 ' 375 ~ 380 



Ala Trp Pro Glu Asn Arg Thr Asp Leu His Ala Phe Glu Asn Leu Glu 
385 ^ 390 395 400 



lie He Arg Gly Arg Thr Lys Gin His Gly Gin Phe Ser Leu Ala Val 
405 410 415 



Val Ser Leu Asn He Thr Ser Leu Gly Leu Arg Ser Leu Lys Glu He 
420 425 430 



Ser Asp Gly Asp Val He He Ser Gly Asn Lys Asn Leu Cys Tyr Ala 
435 440 445 



Asn Thr He Asn Trp Lys Lys Leu Phe Gly Thr Ser Gly Gin Lys Thr 
450 455 460 



Lys He He Ser Asn Arg Gly Glu Asn Ser Cys Lys Ala Thr Gly Gin 
465 470 475 480 



Val Cys His Ala Leu Cys Ser Pro Glu Gly Cys Trp Gly Pro Glu Pro 
485 490 495 



Arg Asp Cys Val Ser 
500 



<210> 2 

<211> 509 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Ser Thr Gin Val Cys Thr Gly Thr Asp Met Lys Leu Arg Leu Pro Ala 
15 10 15 



Ser Pro Glu Thr His Leu Asp Met Leu Arg His Leu Tyr Gin Gly Cys 
20 25 30 
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Gin Val Val Gin Gly Asn Leu Glu Leu Thr Tyr Leu Pro Thr Asn Ala 
35 40 45 



Ser Leu Ser Phe Leu Gin Asp He Gin Glu Val Gin Gly Tyr Val Leu 
50 55 60 



He Ala His Asn Gin Val Arg Gin Val Pro Leu Gin Arg Leu Arg He 
65 70 75 80 



Val Arg Gly Thr Gin Leu Phe Glu Asp Asn Tyr Ala Leu Ala Val Leu 
85 90 95 



Asp Asn Gly Asp Pro Leu Asn Asn Thr Thr Pro Val Thr Gly Ala Ser 
100 105 110 



Pro Gly Gly Leu Arg Glu Leu Gin Leu Arg Ser Leu Thr Glu He Leu 
115 J 120 125 



Lys Gly Gly Val Leu He Gin Arg Asn Pro Gin Leu Cys Tyr Gin Asp 
130 ~ 135 140 



Thr He Leu Trp Lys Asp He Phe His Lys Asn Asn Gin Leu Ala Leu 
145 ^ 150 155 160 



Thr Leu He Asp Thr Asn Arg Ser Arg Ala Cys His Pro Cys Ser Pro 
165 • 170 175 



Met Cys Lys Gly Ser Arg Cys Trp Gly Glu Ser Ser Glu Asp Cys Gin 
18 0 185 190 



Ser Leu Thr Arg Thr Val Cys Ala Gly Gly Cys Ala Arg Cys Lys Gly 
195 200 205 



Pro Leu Pro Thr Asp Cys Cys His Glu Gin Cys Ala Ala Gly Cys Thr 
210 ~ 215 220 



Gly Pro Lys His Ser Asp Cys Leu Ala Cys Leu His Phe Asn His Ser 
225 "* 230 235 240 



Gly He Cys Glu Leu His Cys Pro Ala Leu Val Thr Tyr Asn Thr Asp 
245 250 255 



Thr Phe Glu Ser Met Pro Asn Pro Glu Gly Arg Tyr Thr Phe Gly Ala 
260 265 270 



Ser Cys Val Thr Ala Cys Pro Tyr Asn Tyr Leu Ser Thr Asp Val Gly 
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275 



280 



285 



Ser Cys Thr Leu Val Cys Pro Leu His Asn Gin Glu Val Thr Ala Glu 
290 295 300 



Asp Gly Thr Gin Arg Cys Glu Lys Cys Ser Lys Pro Cys Ala Arg Val 
305 310 315 320 



Cys Tyr Gly Leu Gly Met Glu His Leu Arg Glu Val Arg Ala Val Thr 
325 330 335 



Ser Ala Asn lie Gin Glu Phe Ala Gly Cys Lys Lys lie Phe Gly Ser 
340 345 ~ "* 350 



Leu Ala Phe Leu Pro Glu Ser Phe Asp Gly Asp Pro Ala Ser Asn Thr 
355 360 365 



Ala Pro Leu Gin Pro Glu Gin Leu Gin Val Phe Glu Thr Leu Glu Glu 
370 375 380 



lie Thr Gly Tyr Leu Tyr lie Ser Ala Trp Pro Asp Ser Leu Pro Asp 
385 390 395 400 



Leu Ser Val Phe Gin Asn Leu Gin Val lie Arg Gly Arg lie Leu His 
405 410 " ^ 415 



Asn Gly Ala Tyr Ser Leu Thr Leu Gin Gly Leu Gly lie Ser Trp Leu 
420 425 430 



Gly Leu Arg Ser Leu Arg Glu Leu Gly Ser Gly Leu Ala Leu lie His 
435 440 445 



His Asn Thr His Leu Cys Phe Val His Thr Val Pro Trp Asp Gin Leu 
450 455 460 



Phe Arg Asn Pro His Gin Ala Leu Leu His Thr Ala Asn Arg Pro Glu 
465 470 475 480 



Asp Glu Cys Val Gly Glu Gly Leu Ala Cys His Gin Leu Cys Ala Arg 
485 490 495 



Gly His Cys Trp Gly Pro Gly Pro Thr Gin Cys Val Asn 
500 505 



<210> 3 
<211> 500 



WO 03/014159 



6/21 



PCT/AU02/01042 



<212> PRT 

<213> Homo sapiens 

<400> 3 

Ser Glu Val Gly Asn Ser Gin Ala Val Cys Pro Gly Thr Leu Asn Gly 
1 5 10 15 



Leu Ser Val Thr Gly Asp Ala Glu Asn Gin Tyr Gin Thr Leu Tyr Lys 
20 25 30 



Leu Tyr Glu Arg Cys Glu Val Val Met Gly Asn Leu Glu lie Val Leu 
35 40 45 



Thr Gly His Asn Ala Asp Leu Ser Phe Leu Gin Trp He Arg Glu Val 
50 55 60 



Thr Gly Tyr Val Leu Val Ala Met Asn Glu Phe Ser Thr Leu Pro Leu 
65 70 75 80 



Pro Asn Leu Arg Val Val Arg Gly Thr Gin Val Tyr Asp Gly Lys Phe 
85 90 95 



Ala He Phe Val Met Leu Asn Tyr Asn Thr Asn Ser Ser His Ala Leu 
100 105 110 



Arg Gin Leu Arg Leu Thr Gin Leu Thr Glu He Leu Ser Gly Gly Val 
115 120 125 



Tyr He Glu Lys Asn Asp Lys Leu Cys His Met Asp Thr He Asp Trp 
130 135 140 



Arg Asp He Val Arg Asp Arg Asp Ala Glu He Val Val Lys Asp Asn 
145 *" 150 155 160 



Gly Arg Ser Cys Pro Pro Cys His Glu Val Cys Lys Gly Arg Cys Trp 
165 170 175 



Gly Pro Gly Ser Glu Asp Cys Gin Thr Leu Thr Lys Thr He Cys Ala 
180 185 190 



Pro Gin Cys Asn Gly His Cys Phe Gly Pro Asn Pro Asn Gin Cys Cys 
195 200 205 



His Asp Glu Cys Ala Gly Gly Cys Ser Gly Pro Gin Asp Thr Asp Cys 
210 215 220 
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Phe Ala Cys Arg His Phe Asn Asp Ser Gly Ala Cys Val Pro Arg Cys 
225 " " 230 235 240 



Pro Gin Pro Leu Val Tyr Asn Lys Leu Thr Phe Gin Leu Glu Pro Asn 
245 250 255 



Pro His Thr Lys Tyr Gin Tyr Gly Gly Val Cys Val Ala Ser Cys Pro 
260 265 270 



His Asn Phe Val Val Asp Gin Thr Ser Cys Val Arg Ala Cys Pro Pro 
275 280 285 



Asp Lys Met Glu Val Asp Lys Asn Gly Leu Lys Met Cys Glu Pro Cys 
290 295 300 



Gly Gly Leu Cys Pro Lys Ala Cys Glu Gly Thr Gly Ser Gly Ser Arg 
305 " " 310 315 320 



Phe Gin Thr Val Asp Ser Ser Asn He Asp Gly Phe Val Asn Cys Thr 
325 330 335 



Lys He Leu Gly Asn Leu Asp Phe Leu He Thr Gly Leu Asn Gly Asp 
340 345 350 



Pro Trp His Lys He Pro Ala Leu Asp Pro Glu Lys Leu Asn Val Phe 
355 360 365 



Arg Thr Val Arg Glu He Thr Gly Tyr Leu Asn He Gin Ser Trp Pro 
370 " 375 380 



Pro His Met His Asn Phe Ser Val Phe Ser Asn Leu Thr Thr He Gly 
385 390 395 400 



Gly Arg Ser Leu Tyr Asn Arg Gly Phe Ser Leu Leu He Met Lys Asn 
405 " 410 415 



Leu Asn Val Thr Ser Leu Gly Phe Arg Ser Leu Lys Glu He Ser Ala 
420 425 430 



Gly Arg He Tyr He Ser Ala Asn Arg Gin Leu Cys Tyr His His Ser 
435 440 445 



Leu Asn Trp Thr Lys Val Leu Arg Gly Pro Thr Glu Glu Arg Leu Asp 
450 * ^ 455 460 



He Lys His Asn Arg Pro Arg Arg Asp Cys Val Ala Glu Gly Lys Val 
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465 470 475 480 



Cys Asp Pro Leu Cys Ser Ser Gly Gly Cys Trp Gly Pro Gly Pro Gly 
485 490 495 



Gin Cys Leu Ser 
500 



<210> 4 

<211> 502 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Gin Pro Ser Asp Ser Gin Ser Val Cys Ala Gly Thr Glu Asn Lys Leu 
15 10 15 



Ser Ser Leu Ser Asp Leu Glu Gin Gin Tyr Arg Ala Leu Arg Lys Tyr 
20 25 30 



Tyr Glu Asn Cys Glu Val Val Met Gly Asn Leu Glu He Thr Ser He 
35 40 45 



Glu His Asn Arg Asp Leu Ser Phe Leu Arg Ser Val Arg Glu Val Thr 
50 55 60 



Gly Tyr Val Leu Val Ala Leu Asn Gin Phe Arg Tyr Leu Pro Leu Glu 
65 " 70 75 80 



Asn Leu Arg He He Arg Gly Thr Lys Leu Tyr Glu Asp Arg Tyr Ala 
85 90 95 



Leu Ala He Phe Leu Asn Tyr Arg Lys Asp Gly Asn Phe Gly Leu Gin 
100 ' 105 110 



Glu Leu Gly Leu Lys Asn Leu Thr Glu He Leu Asn Gly Gly Val Tyr 
115 120 125 



Val Asp Gin Asn Lys Phe Leu Cys Tyr Ala Asp Thr He His Trp Gin 
130 135 140 



Asp He Val Arg Asn Pro Trp Pro Ser Asn Leu Thr Leu Val Ser ,Thr 
145 150 155 160 



Asn Gly Ser Ser Gly Cys Gly Arg Cys His Lys Ser Cys Thr Gly Arg 
165 170 175 



WO 03/014159 



9/21 



PCT/AU02/01042 



Cys Trp Gly Pro Thr Glu Asn His Cys Gin Thr Leu Thr Arg Thr Val 
18 0 185 190 



Cys Ala Glu Gin Cys Asp Gly Arg Cys Tyr Gly Pro Tyr Val Ser Asp 
195 200 205 



Cys Cys His Arg Glu Cys Ala Gly Gly Cys Ser Gly Pro Lys Asp Thr 
210 215 220 



Asp Cys Phe Ala Cys Met Asn Phe Asn Asp Ser Gly Ala Cys Val Thr 
225 230 235 240 



Gin Cys Pro Gin Thr Phe Val Tyr Asn Pro Thr Thr Phe Gin Leu Glu 
245 250 255 



His Asn Phe Asn Ala Lys Tyr Thr Tyr Gly Ala Phe Cys Val Lys Lys 
260 265 270 



Cys Pro His Asn Phe Val Val Asp Ser Ser Ser Cys Val Arg Ala Cys 
275 280 285 



Pro Ser Ser Lys Met Glu Val Glu Glu Asn Gly He Lys Met Cys Lys 
290 295 300 



Pro Cys Thr Asp He Cys Pro Lys Ala Cys Asp Gly He Gly Thr Gly 
305 310 315 320 



Ser Leu Met Ser Ala Gin Thr Val Asp Ser Ser Asn He Asp Lys Phe 
325 330 335 



He Asn Cys Thr Lys He Asn Gly Asn Leu lie Phe Leu Val Thr Gly 
340 345 350 



He His Gly Asp Pro Tyr Asn Ala He Glu Ala He Asp Pro Glu Lys 
355 360 365 



Leu Asn Val Phe Arg Thr Val Arg Glu He Thr Gly Phe Leu Asn He 
370 375 380 



Gin Ser Trp Pro Pro Asn Met Thr Asp Phe Ser Val Phe Ser Asn Leu 
385 ~ 390 395 400 



Val Thr He Gly Gly Arg Val Leu Tyr Ser Gly Leu Ser Leu Leu He 
405 410 415 
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Leu Lys Gin Gin Gly lie Thr Ser Leu Gin Phe Gin Ser Leu Lys Glu 
420 " 425 430 



lie Ser Ala Gly Asn lie Tyr lie Thr Asp Asn Ser Asn Leu Cys Tyr 
435 ~ 440 445 



Tyr His Thr He Asn Trp Thr Thr Leu Phe Ser Thr He Asn Gin Arg 
450 455 460 



He Val He Arg Asp Asn Arg Lys Ala Glu Asn Cys Thr Ala Glu Gly 
465 470 475 480 



Met Val Cys Asn His Leu Cys Ser Ser Asp Gly Cys Trp Gly Pro Gly 
485 490 495 



Pro Asp Gin Cys Leu Ser 
500 



<210> 5 

<211> 53 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His 
15 10 15 



Asp Gly Val Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala Cys Asn 
20 25 30 



Cys Val Val Gly Tyr He Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys 
35 ^ ~ 40 45 



Trp Trp Glu Leu Arg 
50 



<210> 6 

<211> 52 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Val Val Ser His Phe Asn Asp Cys Pro Asp Ser His Thr Gin Phe Cys 
15 10 15 



Phe His Gly Thr Cys Arg Phe Leu Val Gin Glu Asp Lys Pro Ala Cys 
20 25 30 
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Val Cys His Ser Gly Tyr Val Gly Ala Arg Cys Glu His Ala Asp Leu 
35 40 45 



Leu Ala Val Val 
50 



<210> 7 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Lys Lys Lys Asn Pro Cys Asn Ala Glu Phe Gin Asn Phe Cys lie His 
1 ^ " 5 10 15 



Gly Glu Cys Lys Tyr He Glu His Leu Glu Ala Val Thr Cys Lys Cys 
20 25 30 



Gin Gin Glu Tyr Phe Gly Glu Arg Cys Gly Glu Lys 
35 40 



<210> 8 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr Lys Asp Phe Cys He His 
1 5 10 15 



Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg Ala Pro Ser Cys He Cys 
20 25 30 



His Pro Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser Leu Pro Val 
35 40 45 



Glu Asn 
50 



<210> 9 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Gly His Phe Ser Arg Cys Pro Lys Gin Tyr Lys His Tyr Cys He Lys 
1 5 10 15 
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Gly Arg Cys Arg Phe Val Val Ala Glu Gin Thr Pro Ser Cys Val Cys 
20 25 30 



Asp Glu Gly Tyr He Gly Ala Arg Cys Glu Arg Val Asp Leu Phe Tyr 
35 40 45 



Leu Arg 
50 



<210> 10 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Val Ser He Thr Lys Cys Ser Ser Asp Met Asn Gly Tyr Cys Leu His 
15 10 15 



Gly Gin Cys He Tyr Leu Val Asp Met Ser Gin Asn Tyr Cys Arg Cys 
20 25 30 



Glu Val Gly Tyr Thr Gly Val Arg Cys Glu His Phe Phe Leu Thr Val 
35 40 ' 45 



His Gin 
50 



<210> 11 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 11 

Lys Phe Ser His Pro Cys Leu Glu Asp His Asn Ser Tyr Cys He Asn 
15 10 15 



Gly Ala Cys Ala Phe His His Glu Leu Lys Gin Ala He Cys Arg Cys 
20 25 30 



Phe Thr Gly Tyr Thr Gly Gin Arg Cys Glu His Leu Thr Leu Thr Ser 
35 40 45 



Tyr Ala 
50 



<210> 12 
<211> 53 
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<212> PRT 

<213> Homo sapiens 

<400> 12 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 



Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 



Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
35 " 40 45 



Val Pro Met Lys Val 
50 



<210> 13 

<211> 53 

<212> PRT 

<213> Homo sapiens 

<400> 13 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 



Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 



Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 * 40 45 



Val Met Ala Ser Phe 
50 



<210> 14 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Gly His Ala Arg Lys Cys Asn Glu Thr Ala Lys Ser Tyr Cys Val Asn 
1 5 10 15 



Gly Gly Val Cys Tyr Tyr He Glu Gly He Asn Gin Leu Ser Cys Lys 
20 25 30 



Cys Pro Asn Gly Phe Phe Gly Gin Arg Cys Leu Glu Lys Leu Pro Leu 
35 40 45 
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Arg Leu 
50 



<210> 15 

<211> 50 

<212> PRT . 

<213> Homo sapiens 

<400> 15 



Gly His Ala Arg Lys Cys Asn Glu Thr Ala Lys Ser Tyr Cys Val Asn 
1 5 10 15 



Gly Gly Val Cys Tyr Tyr lie Glu Gly lie Asn Gin Leu Ser Cys Lys 
2 0 ~ 25 30 



Cys Pro Val Gly Tyr Thr Gly Asp Arg Cys Gin Gin Phe Ala Met Val 
35 ~ ~ 40 45 



Asn Phe 
50 



<210> 16 

<211> 52 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Glu His Phe Lys Pro Cys Arg Asp Lys Asp Leu Ala Tyr Cys Leu Asn 
1 5 10 15 



Asp Gly Glu Cys Phe Val He Glu Thr Leu Thr Gly Ser His Lys His 
20 25 30 



Cys Arg Cys Lys Glu Gly Tyr Gin Gly Val Arg Cys Asp Gin Phe Leu 
35 " 40 45 



Pro Lys Thr Asp 
50 



<210> 17 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Asp His Glu Gin Pro Cys Gly Pro Arg His Arg Ser Phe Cys Leu Asn 
1 5 10 15 
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Gly Gly lie Cys Tyr Val lie Pro Thr lie Pro Ser Pro Phe Cys Arg 
20 " 25 30 



Cys lie Glu Asn Tyr Thr Gly Ala Arg Cys Glu Glu Val Phe Leu Pro 
35 40 45 



Ser Ser 
50 



<210> 18 

<211> 186 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Leu Glu Glu Lys Lys Val Cys Gin Gly Thr Ser Asn Lys Leu Thr Gin 
1 J 5 10 15 



Leu Gly Thr Phe Glu Asp His Phe Leu Ser Leu Gin Arg Met Phe Asn 
20 25 30 



Asn Cys Glu Val Val Leu Gly Asn Leu Glu lie Thr Tyr Val Gin Arg 
35 40 45 



Asn Tyr Asp Leu Ser Phe Leu Lys Thr He Gin Glu Val Ala Gly Tyr 
50 '55 60 



Val Leu He Ala Leu Asn Thr Val Glu Arg He Pro Leu Glu Asn Leu 
65 70 75 80 



Gin He He Arg Gly Asn Met Tyr Tyr Glu Asn Ser Tyr Ala Leu Ala 
85 90 95 



Val Leu Ser Asn Tyr Asp Ala Asn Lys Thr Gly Leu Lys Glu Leu Pro 
100 105 110 



Met Arg Asn Leu Gin Glu He Leu His Gly Ala Val Arg Phe Ser Asn 
115 120 125 



Asn Pro Ala Leu Cys Asn Val Glu Ser He Gin Trp Arg Asp He Val 
130 135 140 



Ser Ser Asp Phe Leu Ser Asn Met Ser Met Asp Phe Gin Asn His Leu 
145 150 155 160 
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Gly Ser Cys Gin Lys Cys Asp Pro Ser Cys Pro Asn Gly Ser Cys Trp 
165 " 170 175 



Gly Ala Gly Glu Glu Asn Cys Gin Lys Leu 
180 185 



<210> 19 

<211> 183 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Glu He Cys Gly Pro Gly He Asp He Arg Asn Asp Tyr Gin Gin Leu 
15 10 15 



Lys Arg Leu Glu Asn Cys Thr Val He Glu Gly Tyr Leu His He Leu 
20 25 30 



Leu He Ser Lys Ala Glu Asp Tyr Arg Ser Tyr Arg Phe Pro Lys Leu 
35 40 45 



Thr Val He Thr Glu Tyr Leu Leu Leu Phe Arg Val Ala Gly Leu Glu 
50 55 60 



Ser Leu Gly Asp Leu Phe Pro Asn Leu Thr Val He Arg Gly Trp Lys 
65 7 0 75 8 0 



Leu Phe Tyr Asn Tyr Ala Leu Val He Phe Glu Met Thr Asn Leu Lys 
85 90 95 



Asp He Gly Leu Tyr Asn Leu Arg Asn He Thr Arg Gly Ala He Arg 
100 105 110 



He Glu Lys Asn Ala Asp Leu Cys Tyr Leu Ser Thr Val Asp Trp Ser 
115 120 125 



Leu He Leu Asp Ala Val Ser Asn Asn Tyr He Val Gly Asn Lys Pro 
130 * 135 140 



Pro Lys Glu Cys Gly Asp Leu Cys Pro Gly Thr Met Glu Glu Lys Pro 
145 " 150 155 160 



Met Cys Glu Lys Thr Thr He Asn Asn Glu Tyr Asn Tyr Arg Cys Trp 
165 170 175 



Thr Thr Asn Arg Cys Gin Lys 
180 



WO 03/014159 



17/21 



PCT/AU02/01042 



<210> 20 

<211> 161 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Lys Val Cys Glu Glu Glu Lys Lys Thr Lys Thr lie Asp Ser Val Thr 
1 5 10 15 



Ser Ala Gin Met Leu Gin Gly Cys Thr He Phe Lys Gly Asn Leu Leu 
20 25 30 



He Asn He Arg Arg Gly Asn Asn He Ala Ser Glu Leu Glu Asn Phe 
35 ~ 40 45 



Met Gly Leu He Glu Val Val Thr Gly Tyr Val Lys He Arg His Ser 
50 55 60 



His Ala Leu Val Ser Leu Ser Phe Leu Lys Asn Leu Arg Leu He Leu 
65 70 75 80 



Gly Glu Glu Gin Leu Glu Gly Asn Tyr Ser Phe Tyr Val Leu Asp Asn 
85 90 95 



Gin Asn Leu Gin Gin Leu Trp Asp Trp Asp His Arg Asn Leu Thr He 
100 105 110 



Lys Ala Gly Lys Met Tyr Phe Ala Phe Asn Pro Lys Leu Cys Val Ser 
115 ~ 12 0 125 



Glu He Tyr Arg Met Glu Glu Val Thr Gly Thr Lys Gly Arg Gin Ser 
130 ~ 135 140 



Lys Gly Asp He Asn Thr Arg Asn Asn Gly Glu Arg Ala Ser Cys Glu 
145 ^ 150 155 160 



Ser 



<210> 21 

<211> 191 

<212> PRT 

<213> Homo sapiens 

<400> 21 



Lys Val Cys Asn Gly He Gly He Gly Glu Phe Lys Asp Ser Leu Ser 
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10 15 



lie Asn Ala Thr Asn lie Lys His Phe Lys Asn Cys Thr Ser lie Ser 
20 25 ~ 30 



Gly Asp Leu His lie Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr 
35 40 45 



His Thr Pro Pro Leu Asp Pro Gin Glu Leu Asp lie Leu Lys Thr Val 
50 55 60 



Lys Glu lie Thr Gly Phe Leu Leu lie Gin Ala Trp Pro Glu Asn Arg 
65 70 75 80 



Thr Asp Leu His Ala Phe Glu Asn Leu Glu lie lie Arg Gly Arg Thr 
85 90 95 



Lys Gin His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn lie Thr 
100 105 110 



Ser Leu Gly Leu Arg Ser Leu Lys Glu lie Ser Asp Gly Asp Val lie 
115 120 125 



lie Ser Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr lie Asn Trp Lys 
130 " 135 ■ 140 



Lys Leu Phe Gly Thr Ser Gly Gin Lys Thr Lys lie lie Ser Asn Arg 
145 ^ 150 155 160 



Gly Glu Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys 
165 170 175 



Ser Pro Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser 
180 185 190 



<210> 22 

<211> 124 

<212> PRT 

<213> Homo sapiens 

<400> 22 

Thr Lys He He Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg Gly Lys 
15 10 15 



Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys Thr Gly 
20 25 30 
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Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp Glu Ala 
35 40 ~ 45 



Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro Thr Thr 
50 55 60 



Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly Ala Thr 
65 70 75 80 



Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His Gly Ser 
85 " 90 95 



Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu Asp Gly 
100 105 110 



Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg 

120 





115 


<210> 


23 


<211> 


116 


<212> 


PRT 


<213> 


Homo 


<400> 


23 



Met Cys Pro Ser Thr Cys Gly Lys Arg Ala Cys Thr Glu Asn Asn Glu 
15 10 15 



Cys Cys His Pro Glu Cys Leu Gly Ser Cys Ser Ala Pro Asp Asn Asp 
20 "* 25 " 30 



Thr Ala Cys Val Ala Cys Arg His Tyr Tyr Tyr Ala Gly Val Cys Val 
35 40 45 



Pro Ala Cys Pro Pro Asn Thr Tyr Arg Phe Glu Gly Trp Arg Cys Val 
50 55 60 



Asp Arg Asp Phe Cys Ala Asn lie Leu Ser Ala Glu Ser Ser Asp Ser 
65 ~ 70 75 80 



Glu Gly Phe Val lie His Asp Gly Glu Cys Met Gin Glu Cys Pro Ser 
85 ~ 90 95 



Gly Phe lie Arg Asn Gly Ser Gin Ser Met Tyr Cys lie Pro Cys Glu 
100 105 110 



Gly Pro Cys Pro 
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115 


<210> 


24 


<211> 


121 


<212> 


PRT 


<213> 


Homo 


<400> 


24 



Ser Cys Arg Asn Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys 
15 10 15 



Leu Leu Glu Gly Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys lie 
20 " 25 30 



Gin Cys His Pro Glu Cys Leu Pro Gin Ala Met Asn He Thr Cys Thr 
35 " 40 45 



Gly Arg Gly Pro Asp Asn Cys He Gin Cys Ala His Tyr He Asp Gly 
50 55 60 



Pro His Cys Val Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn 
65 70 75 80 



Thr Leu Val Trp Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys 
85 90 95 



His Pro Asn Cys Thr Tyr Gly Cys Thr Gly Pro Gly Leu Glu Gly Cys 
100 105 110 



120 





115 


<210> 


25 


<211> 


50 


<212> 


PRT 


<213> 


Homo 


<400> 


25 



Val Val Ser His Phe Asn Asp Cys Pro Asp Ser His Thr Gin Phe Cys 
15 10 15 



Phe His Gly Thr Cys Arg Phe Leu Val Gin Glu Asp Lys Pro Ala Cys 
20 25 30 



Val Cys His Ser Gly Tyr Val Gly Ala Arg Cys Glu His Ala Asp Leu 
35 40 45 
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Leu Ala 
50 



<210> 26 

<211> 53 

<212> PRT 

<213> Homo sapiens 

<400> 26 



Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His 
1 5 10 15 



Asp Gly Val Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala Cys Asn 
20 25 30 



Cys Val Val Gly Tyr He Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys 
35 40 45 



Trp Trp Glu Leu Arg 
50 



<210> 27 

<211> 48 

<212> PRT 

<213> Homo sapiens 

<400> 27 

Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr Lys Asp Phe Cys 
1 5 10 ^ 15 



He His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg Ala Pro Ser Cys 
20 25 30 



He Cys His Pro Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser Leu 
35 40 45 
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APPENDIX I 



RYST1 51.590 198.710 78.900 90.00 102.03 90.00 P 21 
REMARK Back-to-back 
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32. 


63 


AAAA 


44. 


003 


16. 


296 


1. 


00 


31. 


13 


AAAA 


44. 


889 


17. 


383 


1. 


00 


28. 


76 


AAAA 


44. 


969 


18. 


,504 


1. 


.00 


27. 


83 


AAAA 


44. 


325 


17. 


860 


1. 


,00 


29. 


92 


AAAA 


45. 


810 


14. 


.830 


1. 


.00 


34, 


82 


AAAA 


45. 


863 


14 . 


,089 


1. 


.00 


36. 


01 


AAAA 


46. 


842 


14. 


,994 


1. 


.00 


36. 


.10 


AAAA 


48. 


.064 


14. 


.252 


1. 


.00 


37. 


.88 


AAAA 


48. 


,274 


13, 


.234 


1. 


.00 


37, 


.50 


AAAA 


47. 


,184 


12. 


.196 


1. 


.00 


40. 


.04 


AAAA 


47. 


,377 


11. 


.148 


1. 


.00 


42. 


.95 


AAAA 


48. 


,484 


10. 


.646 


1. 


.00 


45. 


.75 


. AAAA 


46. 


,296 


10. 


.792 


1, 


.00 


43. 


.71 


• AAAA 


49. 


,321 


15. 


.070 


1, 


.00 


38. 


.45 


AAAA 


50. 


,320 


14. 


.547 


1, 


.00 


39. 


.48 


AAAA 


49. 


.265 


16. 


.353 


1, 


.00 


38. 


.74 


AAAA 


50. 


.427 


17. 


.210 


1, 


.00 


38. 


.23 


AAAA 


51. 


.251 


17. 


.164 


1. 


.00 


38. 


.15 


AAAA 


52. 


,448 


18. 


.097 


1, 


.00 


39. 


, 43 


AAAA 


53. 


.542 


17. 


.682 


1 


.00 


41. 


.05 


AAAA 


54. 


.009 


16. 


.529 


1, 


.00 


42, 


.73 


AAAA 


53. 


.937 


18. 


.505 


1 


.00 


39, 


.66 


AAAA 


50. 


.082 


18, 


.660 


1 


.00 


38, 


.25 


AAAA 


49. 


.212 


19, 


.278 


1 


.00 


38, 


.68 


AAAA 


50. 


.768 


19, 


.181 


1. 


.00 


36, 


.89 


AAAA 


50, 


.628 


20 


.563 


1 


.00 


35. 


.34 


AAAA 


50. 


.153 


20 


.661 


1 


.00 


36, 


.18 


AAAA 


50. 


.107 


22 


.111 


1 


.00 


35. 


.24 . 


AAAA 


48. 


.767- 


20 


.034 


1 


.00 


37. 


.14 


. AAAA 


48 


.187 


20 


.146 


1 


.00 


36. 


. 91 


AAAA 


52 


.052 


21 


.101 


1 


.00 


34 


.22 


AAAA 


52. 


.831 


21 


.005 


1 


.00 


32 


.90 


AAAA 


52 


.388 


21 


.646 


1 


.00 


33, 


.33 


AAAA 


53 


.716 


22 


.178 


1 


.00 


33 


.63 


AAAA 


53 


.713 


22 


. 919 


1 


.00 


31 


.58 


AAAA 


53 


.218 


22 


.058 


1 


.00 


30 


.79 


AAAA 


53 


.174 


22 


.855 


1 


.00 


29 


.83 


AAAA 


54 


.118 


20 


.871 


1 


.00 


30 


.80 


AAAA 


54 


.229 


23 


.055 


1 


.00 


35 


.35 


AAAA 


54 


.581 


22 


.540 


1 


.00 


35 


.42 


AAAA 


54 


.277 


24 


.366 


1 


.00 


37 


.12 


AAAA 


54 


.756 


25 


.251 


1 


.00 


38 


.24 


AAAA 


55 


.154 


26 


.623 


1 


.00 


39 


.82 


AAAA 


55 


.942 


26 


.541 


1 


.00 


41 


.46 


AAAA 


57 


.233 


26 


.846 


1 


.00 


43 


.26 


AAAA 


55 


.404 


26 


.077 


1 


. 00 


42 


.51 


AAAA 


56 


.329 


26 


.096 


1 


.00 


42 


.63 


AAAA 


57 


.449 


26 


.557 


1 


.00 


42 


.96 


■ AAAA 


53 


.607 


25 


.422 


1 


.00 


38 


.60 


AAAA 


52.452 


25 


.257 


1 


.00 


39 


.83 


AAAA 


53.908 


25 


.713 


1 


.00 


38 


.05 


AAAA 
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ATOM 


971 


CA 


GLY A 122 


19.564 


52.852 


25. 937 


1.00 36.52 


AAAA 


ATOM 


972 


C 


GLY A 122 


20.525 


52.413 


24.831 


1.00 35.69 


AAAA 


ATOM 


973 


O 


GLY A 122 


20.175 


52.400 


23.657 


1.00 35.02 


AAAA 


ATOM 


974 


N 


ALA A 123 


21.741 


52.018 


25.206 


1.00 35.54 


AAAA 


ATOM 


975 


CA 


ALA A 123 


22.745 


51.593 


24.220 


1.00 35.46 


AAAA 


ATOM 


97 6 


CB 


ALA A 123 


24.096 


52.193 


24.570 


1.00 35.63 


AAAA 


ATOM 


977 


C 


ALA A 123 


22.891 


50.075 


24.048 


1.00 35.18 


AAAA 


ATOM 


978 


O 


ALA A 123 


22.214 


49.302 


24.717 


1.00 34.77 


AAAA 


ATOM 


979 


N 


VAL A 124 


23.786 


49.661 


23.147 


1.00 34.53 


AAAA 


ATOM 


980 


CA 


VAL A 124 


24.026 


48.241 


22.877 


1.00 33.64 


AAAA 


ATOM 


981 


CB 


VAL A 124 


23.438 


47.845 


21.500 


1.00 33.47 


AAAA 


ATOM 


982 


CGI 


VAL A 124 


23.556 


46.353 


21.289 


1.00 34.26 


AAAA 


ATOM 


983 


CG2 


VAL A 124 


21.981 


48.266 


21.417 


1.00 32.39 


AAAA 


ATOM 


984 


C 


VAL A 124 


25.523 


47.878 


22.922 


1.00 33.14 


AAAA 


ATOM 


985 


O 


VAL A 124 


26.380 


48.699 


22.612 


1.00 32.68 


AAAA 


ATOM 


986 


N 


ARG A 125 


25.827 


46.642 


23.315 


1.00 32. 99 


AAAA 


ATOM 


987 


CA 


ARG A 125 


27.214 


46.174 


23.419 


1.00 32.12 


AAAA 


ATOM 


988 


CB 


ARG A 125 


27.705 


46.338 


24.858 


1.00 30. 91 


AAAA 


ATOM 


989 


CG 


ARG A 125 


29.027 


45.679 


25.174 


1.00 30.31 


AAAA 


ATOM 


990 


CD 


ARG A 125 


29.440 


45.966 


26. 613 


1.00 30. 90 


AAAA 


ATOM 


991 


NE 


ARG A 125 


30.263 


44.903 


27 .186 


1.00 33.10 


AAAA 


ATOM 


992 


CZ 


ARG A 125 


31.4 95 


44.596 


26.789 


1.00 34.73 


AAAA 


ATOM 


993 


NH1 


ARG A 125 


32.070 


45.276 


25.808 


1.00 37.47 


AAAA 


ATOM 


994 


NH2 


ARG A 125 


32.153 


43.598 


27.368 


1.00 34.41 


AAAA 


ATOM 


995 


C 


ARG A 125 


27.394 


44.718 


22.995 


1.00 32.55 


AAAA 


ATOM 


996 


O 


ARG A 125 


26.550 


43.867 


23.276 


1.00 32.66 


AAAA 


ATOM 


997 


N 


PHE A 126 


28.500 


44.446 


22.311 


1.00 33.29 


AAAA 


ATOM 


998 


CA 


PHE A 126 


28.838 


43.097 


21.856 


1.00 33.81 


AAAA 


ATOM 


999 


CB 


PHE A 126 


28.693 


42.969 


20.325 


1.00 34.48 


AAAA 


ATOM 


1000 


CG 


PHE A 126 


27.313 


42.558 


19.857 


1.00 34.94 


AAAA 


ATOM 


1001 


CD1 


PHE A 126 


26.272 


43.482 


19.797 


1.00 34.51 


AAAA 


ATOM 


1002 


CD2 


PHE A 126 


27.057 


41.242 


19.478 


1.00 34.48 


AAAA 


ATOM 


1003 


CE1 


PHE A 126 


24.999 


43.102 


19.366 


1.00 33.90 


AAAA 


ATOM 


1004 


CE2 


PHE A 126 


25.788 


40.855 


19.048 


1.00 34.28 


AAAA 


ATOM 


1005 


CZ 


PHE A 126 


24.758 


41.789 


18.992 


1.00 33.85 


AAAA 


ATOM 


1006 


C 


PHE A 126 


30.290 


42.786 


22.232 


1.00 34.31 


AAAA 


ATOM 


1007 


O 


PHE A 126 


31.115 


43.695 


22.353 


1.00 35.00 


AAAA 


ATOM 


1008 


N 


SER A 127 


30.590 


41.503 


22.422 


1.00 34.4 6 


AAAA 


ATOM 


1009 


CA 


SER A 127 


31.946 


41.042 


22.742 


1.00 34.03 


AAAA 


ATOM 


1010 


CB 


SER A 127 


32.572 


41.880 


23.866 


1.00 35.24 


AAAA 


ATOM 


1011 


OG 


SER A 127 


31.866 


41.741 


25.080 


1.00 39.63 


AAAA 


ATOM 


1012 


C 


SER A 127 


32.005 


39.558 


23.108 


1.00 31.55 


AAAA 


ATOM 


1013 


O 


SER A 127 


31.211 


39.068 


23.906 


1.00 29.87 


AAAA 


ATOM 


1014 


N 


ASN A 128 


32.962 


38.864 


22.500 


1.00 30.22 


PJ\Ah 


ATOM 


1015 


CA 


ASN A 128 


33.195 


37.438 


22.689 


1.00 30.10 


AAAA 


ATOM 


1016 


CB 


ASN A 128 


33.532 


37.139 


24.153 


1,00 31.19 


AAAA 


ATOM 


1017 


CG 


ASN A 128 


34-819 


37.827 


24.617 


1.00 32.24 


AAAA 


ATOM 


1018 


OD1 


ASN A 128 


34.783 


38.733 


25. 454 


1.00 34.03 


AAAA 


ATOM 


1019 


ND2 


: ASN A 128 


35. 957 


37.398 


24.075 


1.00 31.76 


AAAA 


ATOM 


1020 


C 


ASN A 128 


32.055 


36.537 


22 .198 


1.00 30.09 


AAAA 


ATOM 


1021 


O 


ASN A 128 


31.381 


35.859 


22.971 


1.00 28.96 


AAAA 


ATOM 


1022 


N 


ASN A 129 


31.860 


36.549 


20.885 


1.00 30.76 


AAAA 


ATOM 


1023 


CA 


ASN A 129 


30. 857 


35.738 


20.203 


1.00 30.82 


AAAA 


ATOM 


1024 


CB 


ASN A 129 


29.670 


36.596 


19.766 


1.00 30.29 


AAAA 


ATOM 


1025 


CG 


ASN A 129 


29.047 


37.351 


20. 905 


1.00 30.62 


AAAA 


ATOM 


1026 


OD1 ASN A 129 


28.534 


36.756 


21.847 


1.00 31.15 


AAAA 


ATOM 


1027 


ND2 ASN A 129 


29.086 


38.677 


20.826 


1.00 29.57 


AAAA 


ATOM 


1028 


C 


ASN A 129 


31.582 


35.225 


18.957 


1.00 31.45 


AAAA 


ATOM 


1029 


O 


ASN A 129 


31.215 


35.559 


17.837 


1.00 32.78 


AAAA 


ATOM 


1030 


N 


PRO A 130 


32.624 


34.407 


19.138 


1.00 31.25 


AAAA 


ATOM 


1031 


CD 


PRO A 130 


33.289 


34.048 


20.396 


1.00 30.90 


AAAA 


ATOM 


1032 


CA 


PRO A 130 


33.376 


33.884 


18.002 


1.00 32.41 


AAAA 


ATOM 


1033 


CB 


PRO A 130 


34.607 


33.281 


18.670 


1.00 32.54 


AAAA 


ATOM 


1034 


CG 


PRO A 130 


34.706 


34.039 


19.974 


1.00 31.82 


AAAA 


ATOM 


1035 


C 


PRO A 130 


32.634 


32.875 


17.127 


1.00 33.71 


AAAA 


ATOM 


1036 


O 


PRO A 130 


33.219 


31.905 


16.651 


1.00 35.01 


AAAA 


ATOM 


1037 


N 


ALA A 131 


31.354 


33.109 


16.893 


1.00 33.82 


AAAA 


ATOM 


1038 


CA 


ALA A 131 


30.585 


32.193 


16.078 


1.00 34.58 


AAAA 


ATOM 


1039 


CB 


ALA A 131 


30.143 


31.021 


16.923 


1.00 35.10 


AAAA 


ATOM 


1040 


C 


ALA A 131 


29.377 


32.904 


15.489 


1.00 35.70 


AAAA 


ATOM 


1041 


O 


ALA A 131 


28.722 


32.407 


14.570 


1.00 35.03 


AAAA 


ATOM 


1042 


N 


LEU A 132 


29.095 


34.081 


16.030 


1.00 36.27 


AAAA 


ATOM 


1043 


CA 


LEU A 132 


27.965 


34.870 


15.591 


1.00 36.85 


AAAA 


ATOM 


1044 


CB 


LEU A 132 


27.874 


36.132 


16.444 


1.00 33.38 


AAAA 


ATOM 


1045 


CG 


LEU A 132 


26.587 


36.928 


16.293 


1.00 33.30 


AAAA 
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ATOM 


1046 


CD1 


LEU 


A 


132 


25.393 


36.041 


16.562 


1. 


00 


32.65 


AAAA 


ATOM 


1047 


CD2 


LEU 


A 


132 


26.607 


38. 


097 


17. 


241 


1.00 


33.69 


AAAA 


ATOM 


1048 


C 


LEU 


A 


132 


28. 


165 


35.222 


14. 


127 


1. 


00 


39.24 


AAAA 


ATOM 


1049 


0 


LEU 


A 


132 


29.201 


35. 


772 


13. 


766 


1. 


00 


41. 


29 


AAAA 


ATOM 


1050 


N 


CYS 


A 


133 


27.188 


34. 


892 


13. 


282 


1. 


00 


41. 


16 


AAAA 


ATOM 


1051 


CA 


CYS 


A 


133 


27. 


264 


35. 


191 


11. 


848 


1. 


00 


41. 


90 


AAAA 


ATOM 


1052 


C 


CYS 


A 


133 


26. 


4 05 


36. 


369 


11. 


421 


1. 


00 


41. 


88 


AAAA 


ATOM 


1053 


o 


CYS 


A 


133 


25.311 


36. 


582 


11. 


941 


1. 


00 


41. 


43 


AAAA 


ATOM 


1054 


CB 


CYS 


A 


133 


26. 


815 


34. 


002 


10. 


999 


1. 


00 


42. 


67 


AAAA 


ATOM 


1055 


SG 


CYS 


A 


133 


28. 


000 


32. 


654 


10. 


741 


1. 


00 


47. 


30 


AAAA 


ATOM 


1056 


N 


ASM 


A 


134 


26. 


918 


37. 


118 


10. 


454 


1. 


00 


42. 


64 


AAAA 


ATOM 


1057 


CA 


ASN 


A 


134 


26. 


204 


38.242 


9. 


874 


1. 


00 


44. 


09 


AAAA 


ATOM 


1058 


CB 


ASN 


A 


134 


25. 


002 


37. 


724 


9. 


077 


1. 


00 


44. 


56 


AAAA 


ATOM 


1059 


CG 


ASN 


A 


134 


25. 


404 


37. 


075 


7. 


770 


1. 


00 


45. 


17 


AAAA 


ATOM 


1060 


OD1 


ASN 


A 


134 


25. 


495 


35. 


852 


7. 


674 


1. 


00 


44. 


80 


AAAA 


ATOM 


1061 


ND2 


ASN 


A 


134 


25. 


661 


37. 


900 


6. 


754 


1. 


00 


44 . 


86 


AAAA 


ATOM 


1062 


C 


ASN 


A 


134 


25. 


718 


39. 


373 


10. 


764 


1. 


00 


44. 


64 


AAAA 


ATOM 


1063 


O 


ASN 


A 


134 


24. 


926 


40. 


195 


10. 


306 


1. 


00 


44. 


22 


AAAA 


ATOM 


1064 


N 


VAL 


A 


135 


26. 


155 


39. 


439 


12. 


017 


1. 


00 


45. 


70 


AAAA 


ATOM 


1065 


CA 


VAL 


A 


135 


25. 


674 


40. 


524 


12. 


870 


1. 


00 


45. 


89 


AAAA 


ATOM 


1066 


CB 


VAL 


A 


135 


25. 


519 


40. 


068 


14. 


343 


1. 


00 


46. 


97 


AAAA 


ATOM 


1067 


CGI 


VAL 


A 


135 


25. 


587 


41. 


260 


15. 


292 


1. 


00 


48. 


48 


AAAA 


ATOM 


1068 


CG2 


VAL 


A 


135 


24 . 


164 


39. 


384 


14. 


508 


1. 


00 


46. 


88 


AAAA 


ATOM 


1069 


C 


VAL 


A 


135 


26. 


581 


41. 


732 


12. 


745 


1. 


00 


44 . 


55 


AAAA 


ATOM 


1070 


O 


VAL 


A 


135 


26. 


204 


42. 


847 


13. 


081 


1. 


00 


44. 


11 


AAAA 


ATOM 


1071 


N 


GLU 


A 


136 


27. 


772 


41. 


496 


12. 


225 


1. 


00 


43. 


51 


AAAA 


ATOM 


1072 


CA 


GLU 


A 


136 


28. 


733 


42. 


.557 


11. 


996 


1. 


00 


43. 


67 


AAAA 


ATOM 


1073 


CB 


GLU 


A 


136 


30. 


140 


41. 


■963 


11. 


,975 


1. 


00 


45, 


50 


AAAA 


ATOM 


1074 


CG 


GLU 


A 


136 


30. 


186 


40. 


,546 


11. 


,410 


1. 


00 


47. 


,80 


AAAA 


ATOM 


1075 


CD 


GLU 


A 


136 


29. 


509 


40. 


,422 


10. 


.049 


1. 


.00 


49. 


,63 


AAAA 


ATOM 


1076 


OE1 


GLU 


A 


136 


30, 


,186 


40. 


,632 


9. 


.018 


1. 


00 


48. 


,59 


AAAA 


ATOM 


1077 


OE2 


GLU 


A 


136 


28. 


292 


40. 


,125 


10. 


.014 


1. 


.00 


50, 


,58 


AAAA 


ATOM 


1078 


C 


GLU 


A 


136 


28. 


,410 


43. 


.191 


10, 


.639 


1. 


.00 


42. 


.78 


AAAA 


ATOM 


1079 


o 


GLU 


A 


136 


29. 


.254 


43. 


.247 


9. 


.754 


1. 


.00 


44 . 


.13 


AAAA 


ATOM 


1080 


N 


SER 


A 


137 


27. 


.186 


43. 


.668 


10. 


.465 


1. 


.00 


41. 


.95 


AAAA 


ATOM 


1081 


CA 


SER 


A 


137 


26. 


812 


44. 


.260 


9. 


.191 


1. 


.00 


41. 


.46 


AAAA 


ATOM 


1082 


CB 


SER 


A 


137 


26. 


.464 


43. 


.161 


8. 


.199 


1. 


.00 


41. 


.19 


AAAA 


ATOM 


1083 


OG 


SER 


A 


137 


25. 


.206 


42. 


.596 


8. 


.522 


1. 


.00 


40, 


.22 


AAAA 


ATOM 


1084 


C 


SER 


A 


137 


25. 


.611 


45. 


.178 


9, 


.341 


1. 


.00 


41, 


.40 


AAAA 


ATOM 


1085 


O 


SER 


A 


137 


25. 


.389 


46, 


.076 


8. 


.528 


1, 


.00 


40. 


.95 


AAAA 


ATOM 


1086 


N 


ILE 


A 


138 


24. 


.824 


44, 


.933 


10. 


.377 


1. 


.00 


40. 


.68 


AAAA 


ATOM 


1087 


CA 


ILE 


A 


138 


23. 


.652 


45. 


.738 


10. 


.625 


1. 


.00 


40. 


.30 


AAAA 


ATOM 


1088 


CB 


ILE 


A 


138 


22. 


.897 


45, 


.223 


11. 


.859 


1. 


.00 


40. 


.33 


AAAA 


ATOM 


1089 


CG2 


ILE 


A 


138 


21. 


.850 


46, 


.234 


12. 


.314 


1. 


.00 


41. 


.04 


AAAA 


ATOM 


1090 


CGI 


ILE 


A 


138 


22. 


.275 


43. 


.871 


11. 


.541 


1. 


.00 


38. 


.98 


AAAA 


ATOM 


1091 


CD1 


ILE 


A 


138 


23. 


.291 


42, 


.781 


11. 


.428 


1. 


.00 


39. 


.86 


AAAA 


ATOM 


1092 


C 


ILE 


A 


138 


24 . 


.038 


47, 


.194 


10. 


.851 


1. 


.00 


40. 


.77 


AAAA 


ATOM 


1093 


O 


ILE 


A 


138 


24. 


.973 


47. 


.486 


11. 


.597 


1. 


.00 


39. 


.73 


AAAA 


ATOM 


1094 


N 


GLN 


A 


139 


23. 


.323 


48. 


.095 


10. 


,175 


1. 
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A 


232 


-5 


.326 


53 


.262 


22 


. 222 


1 


.00 


39 


.61 


AAAA 


ATOM 


1783 


CB 


ASP 


A 


232 


-5 


.780 


54 


.674 


21 


.845 


1 


.00 


38 


.66 


AAAA 


ATOM 


1784 


CG 


ASP 


A 


232 


-7 


.148 


55 


.024 


22 


.426 


1 


.00 


38 


.60 


AAAA 


ATOM 


1785 


OD1 


ASP 


A 


232 


-7 


.712 


56 


.064 


22 


.043 


1 


.00 


40 


.26 


AAAA 


ATOM 


1786 


OD2 


ASP 


A 


232 


-7 


.666 


54 


.271 


23 


.270 


1 


.00 


37 


.24 


AAAA 


ATOM 


1787 


C 


ASP 


A 


232 


-6 


.018 


52 


.228 


21 


.345 


1 


.00 


39 


.88 


AAAA 


ATOM 


1788 


O 


ASP 


A 


232 


-6 


.382 


52 


.480 


20 


.197 


1 


.00 


38 


.47 


AAAA 


ATOM 


1789 


N 


GLU 


A 


233 


-6 


.179 


51 


.046 


21 


.920 


1 


.00 


41 


.63 


AAAA 


ATOM 


1790 


CA 


GLU 


A 


233 


-6 


.769 


49 


.915 


21 


.243 


1 


.00 


42 


. 44 


AAAA 


ATOM 


1791 


CB 


GLU 


A 


233 


-8 


.259 


50 


.128 


20 


.984 


1 


.00 


44 


.32 


AAAA 


ATOM 


1792 


CG 


GLU 


A 


233 


-8 


.995 


48 


.813 


20 


.813 


1 


.00 


48 


.29 


AAAA 


ATOM 


1793 


CD 


GLU 


A 


233 


-8 


.727 


47 


.835 


21 


.967 


1 


.00 


51 


.67 


AAAA 


ATOM 


1794 


OE1 


GLU 


A 


233 


-9 


.292 


48 


.022 


23 


.069 


1 


.00 


52 


.51 


AAAA 


ATOM 


1795 


OE2 


GLU 


A 


233 


-7 


.939 


46 


.882 


21 


.776 


1 


.00 


52 


. 98 


AAAA 
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ATOM 


1796 


C 


GLU 


A 


233 


-6, 


030 


49. 


694 


19. 


941 


1. 


00 


41. 


69 


AAAA 


ATOM 


17 97 


O 


GLU 


A 


233 


-4. 


978 


49. 


063 


19. 


924 


1. 


00 


41. 


89 


AAAA 


ATOM 


1798 


N 


ALA 


A 


234 


-6. 


544 


50. 


243 


18. 


855 


1. 


00 


41. 


27 


AAAA 


ATOM 


1799 


CA 


ALA 


A 


234 


-5. 


8 93 


50. 


032 


17. 


575 


1. 


00 


42. 


27 


. AAAA 


ATOM 


1800 


CB 


ALA 


A 


234 


-6. 


938 


49. 


572 


16. 


553 


1. 


00 


44. 


02 


AAAA 


ATOM 


1801 


C 


ALA 


A 


234 


-5. 


107 


51. 


224 


17. 


030 


1. 


00 


41. 


41 


AAAA 


ATOM 


1802 


O 


ALA 


A 


234 


-4 . 


851 


51. 


310 


15. 


830 


1. 


00 


41. 


98 


AAAA 


ATOM 


1803 


N 


THR 


A 


235 


-4. 


690 


52. 


128 


17. 


903 


1. 


00 


40. 


03 


AAAA 


ATOM 


1804 


CA 


THR 


A 


235 


-3. 


961 


53. 


299 


17. 


439 


1. 


00 


39.19 


AAAA 


ATOM 


1805 


CB 


THR 


A 


235 


-4. 


929 


54. 


471 


17. 


274 


1. 


00 


39. 


66 


AAAA 


ATOM 


1806 


OG1 


THR 


A 


235 


-5. 


953 


54. 


107 


16.342 


1. 


00 


39. 


44 


AAAA 


ATOM 


1807 


CG2 


THR 


A 


235 


-4. 


204 


55. 


709 


16.784 


1. 


00 


41. 


07 


AAAA 


ATOM 


1808 


C 


THR 


A 


235 


-2. 


871 


53. 


707 


18. 


410 


1. 


00 


37. 


83 


AAAA 


ATOM 


1809 


O 


THR 


A 


235 


-2. 


876 


53. 


273 


19. 


550 


1. 


00 


38. 


57 


AAAA 


ATOM 


1810 


N 


CYS 


A 


236 


-1. 


930 


54. 


533 


17. 


966 


1. 


00 


37. 


08 


AAAA 


ATOM 


1811 


CA 


CYS 


A 


236 


-0. 


878 


54. 


997 


18. 


866 


1. 


00 


36. 


69 


AAAA 


ATOM 


1812 


C 


CYS 


A 


236 


-1. 


179 


56. 


415 


19. 


320 


1. 


00 


36. 


04 


AAAA 


ATOM 


1813 


O 


CYS 


A 


236 


-1. 


659 


57. 


240 


18. 


542 


1. 


00 


35. 


63 


AAAA 


ATOM 


1814 


CB 


CYS 


A 


236 


0. 


505 


54. 


975 


18. 


197 


1. 


00 


36. 


79 


AAAA 


ATOM 


1815 


SG 


CYS 


A 


236 


1. 


131 


53. 


327 


17. 


732 


1. 


00 


38. 


41 


AAAA 


ATOM 


1816 


N 


LYS 


A 


237 


-0. 


898 


56. 


689 


20. 


587 


1. 


00 


35. 


48 


AAAA 


ATOM 


1817 


CA 


LYS 


A 


237 


-1. 


117 


58. 


014 


21. 


151 


1. 


00 


34. 


54 


AAAA 


ATOM 


1818 


CB 


LYS 


A 


237 




385 


58.018 


21. 


997 


1. 


00 


32. 


65 


AAAA 


ATOM 


1819 


CG 


LYS 


A 


237 


-3. 


607 


57, 


482 


21. 


297 


1. 


00 


29. 


86 


AAAA 


ATOM 


1820 


CD 


LYS 


A 


237 


-3. 


984 


58. 


340 


20. 


127- 


1. 


00 


2.7. 


86 


AAAA 


ATOM 


1821 


CE 


LYS 


A 


237 


-5. 


.368 


58. 


,006 


19. 


652 


1. 


00 


26. 


79 


AAAA 


ATOM 


1822 


NZ 


LYS 


A 


237 


-6. 


,343 


58. 


.216 


20. 


736 


1. 


00 


26. 


75 


AAAA 


ATOM 


1823 


C 


LYS 


A 


237 


0. 


.069 


58. 


.416 


22. 


025 


1. 


,00 


35. 


04 


AAAA 


ATOM 


1824 


O 


LYS 


A 


237 


0. 


,884 


57. 


,579 


22. 


406 


1. 


,00 


33. 


.72 


AAAA 


ATOM 


1825 


N 


ASP 


A 


238 


0. 


.158 


59. 


.704 


22. 


332 


1. 


00 


36. 


22 


AAAA 


ATOM 


1826 


CA 


ASP 


A 


238 


1. 


,223 


60. 


.234 


23. 


,178 


1. 


.00 


36. 


.75 


AAAA 


ATOM 


1827 


CB 


ASP 


A 


238 


1. 


.614 


61. 


.628 


22. 


,703 


1. 


.00 


37. 


.69 


AAAA 


ATOM 


1828 


CG 


ASP 


A 


238 


2. 


.618 


62. 


.294 


23. 


.618 


1. 


.00 


40. 


.60 


AAAA 


ATOM 


1829 


OD1 


ASP 


A 


238 


3. 


.720 


61. 


.729 


23. 


.805 


1. 


.00 


41. 


.68 


AAAA 


ATOM 


1830 


OD2 


ASP 


A 


238 


2. 


.305 


63, 


.387 


24. 


.150 


1. 


.00 


41. 


.14 


AAAA 


ATOM 


1831 


C 


ASP 


A 


238 


0. 


.672 


60, 


.319 


24. 


.594 


1. 


.00 


36. 


.57 


. AAAA 


ATOM 


1832 


O 


ASP 


A 


238 


1. 


.405 


60. 


.321 


25. 


.585 


1. 


.00 


36.18 


AAAA 


ATOM 


1833 


N 


THR 


A 


239 


-0. 


.646 


60. 


.367 


24. 


.667 


1. 


.00 


35. 


.86 


AAAA 


ATOM 


1834 


CA 


THR 


A 


239 


-1, 


.337 


60. 


.482 


25. 


.921 


1. 


.00 


37. 


.05 


AAAA 


ATOM 


1835 


CB 


THR 


A 


239 


-1, 


.310 


61. 


.954 


26. 


.404 


1. 


.00 


37. 


.76 


AAAA 


ATOM 


1836 


OG1 


THR 


A 


239 


-0, 


.077 


62. 


.202 


27. 


.094 


1, 


.00 


38. 


.43 


AAAA 


ATOM 


1837 


CG2 


THR 


A 


239 


-2. 


.489 


62. 


.263 


27. 


.315 


1. 


.00 


38. 


. 92 


AAAA 


ATOM 


1838 


C 


THR 


A 


239 




.754 


60. 


.045 


25. 


.648 


1, 


.00 


36, 


.84 


AAAA 


ATOM 


1839 


O 


THR 


A 


239 


-3. 


.394 


60. 


.559 


24. 


.730 


1, 


.00 


37. 


.41 


AAAA 


ATOM 


1840 


N 


CYS 


A 


2'4 0 - 


-3. 


.248 


59, 


.095 


26. 


.434 


1, 


.00 


36, 


,64- 


AAAA 


ATOM 


1841 


CA 


CYS 


A 


240 


-4 . 


.604 


58 


.622 


26. 


.224 


1, 


.00 


36, 


.25 


AAAA 


ATOM 


1842 


C 


CYS 


A 


240 


-5. 


.4 95 


59 


.827 


26. 


.291 


1. 


.00 


35. 


.05 


AAAA 


ATOM 


1843 


0 


CYS 


A 


240 


-5 


.182 


60 


.796 


26. 


. 969 


1. 


.00 


34. 


.52 


AAAA 


ATOM 


1844 


CB 


CYS 


A 


240 


-5 


.037 


57 


.632 


27. 


.307 


1. 


.00 


37, 


.17 


AAAA 


ATOM 


1845 


SG 


CYS 


A 


240 


-3.936 


56 


.207 


27. 


.526 


1. 


.00 


37, 


.97 


AAAA 


ATOM 


1846 


N 


PRO 


A 


241 


-6 


.603 


59 


.803 


25. 


.555 


1 


.00 


35 


.08 


AAAA 


ATOM 


1847 


CD 


PRO 


A 


241 


-7 


.008 


58 


.854 


24 


.505 


1 


.00 


35 


.22 


AAAA 


ATOM 


1848 


CA 


PRO 


A 


241 


-7 


.501 


60 


.956 


25 


.606 


1 


.00 


35 


.36 


AAAA 


ATOM 


1849 


CB 


PRO 


A 


241 


-8 


.54 9 


60 


.618 


24 


.540 


1 


.00 


34 


.35 


AAAA 


ATOM 


1850 


CG 


PRO 


A 


241 


-8 


.474 


59 


.118 


24 


.413 


1 


.00 


34 


.55 


AAAA 


ATOM 


1851 


C 


PRO 


A 


241 


-8 


.066 


61 


.101 


27 


.027 


1 


.00 


33 


.47 


AAAA 


ATOM 


1852 


0 


PRO 


A 


241 


-8 


.622 


60 


.164 


27 


.581 


1 


.00 


32 


.74 


PAPA 


ATOM 


1853 


N 


PRO 


A 


242 


-7 


.911 


62 


.286 


27 


.631 


1 


.00 


33 


.27 


AAAA 


ATOM 


1854 


CD 


PRO 


A 


242 


-7 


.326 


63 


.487 


27 


.021 


1 


.00 


34 


.97 


AAAA 


ATOM 


1855 


CA 


PRO 


A 


242 


-8 


.377 


62 


.590 


28 


.981 


1 


.00 


34 


.95 


AAAA 


ATOM 


1856 


CB 


PRO 


A 


242 


-8 


.038 


64 


.064 


29 


.140 


1 


.00 


35 


.00 


AAAA 


ATOM 


1857 


CG 


PRO 


A 


242 


-6 


.883 


64 


.248 


28 


.228 


1 


,00 


35 


.29 


AAAA 


ATOM 


1858 


C 


PRO 


A 


242 


-9 


.849 


62 


.343 


29 


.156 


1 


.00 


35 


.57 


AAAA 


ATOM 


1859 


O 


PRO 


A 


242 


-10 


.629 


62 


.559 


28. 


.229 


1 


.00 


35 


. 45 


AAAA 


ATOM 


1860 


N 


LEU 


A 


243 


-10 


.218 


61 


.897 


30 


.354 


1 


.00 


36 


.94 


AAAA 


ATOM 


1861 


CA 


LEU 


A 


243 


-11 


.609 


61 


.624 


30 


.682 


1 


.00 


37 


.91 


AAAA 


ATOM 


1862 


CB 


LEU 


A 


243 


-11 


.743 


61 


.163 


32 


.135 


1 


.00 


37 


.75 


AAAA 


ATOM 


1863 


CG 


LEU 


A 


243 


-11 


.280 


59.764 


32 


.555 


1 


.00 


37 


.72 


AAAA 


ATOM 


1864 


CD1 


LEU 


A 


243 


-11 


.605 


59 


.575 


34 


.023 


1 


.00 


36 


.74 


AAAA 


ATOM 


1865 


CD2 


LEU 


A 


243 


-11 


. 969 


58 


.689 


31 


.733 


1 


.00 


36 


.10 


AAAA 


ATOM 


1866 


C 


LEU 


A 


243 


-12 


.444 


62 


.880 


30 


.484 


1 


.00 


38 


.88 


AAAA 


ATOM 


1867 


O 


LEU 


A 


243 


-13 


.635 


62 


.801 


30 


.186 


1 


.00 


39 


.81 


AAAA 


ATOM 


1868 


N 


MET 


A 


244 


-11 


.815 


64 


.040 


30.657 


1 


.00 


38 


.61 


AAAA 


ATOM 


1869 


CA 


MET 


A 


244 


-12 


.504 


65 


.315 


30 


.493 


1 


.00 


37 


.67 


AAAA 


ATOM 


1870 


CB 


MET 


A 244 


-12,828 


65.922 


31 


.852 


1 


.00 


39 


.24 


AAAA 
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ATOM 


1871 


CG 


MET A 244 


-13.735 


ATOM 


1872 


SD 


MET A 24 4 


-15.168 


ATOM 


1873 


CE 


MET A 24 4 


-16.462 


ATOM 


1874 


C 


MET A 24 4 


-11.647 


ATOM 


1875 


0 


MET A 24 4 


-10.461 


ATOM 


1876 


N 


LEU A 245 


-12.253 


ATOM 


1877 


CA 


LEU A 245 


-11.548 


ATOM 


1878 


CB 


LEU A 245 


-11.767 


ATOM 


1879 


CG 


LEU A 245 


-10.543 


ATOM 


1880 


CD1 


LEU A 245 


-9.554 


ATOM 


1881 


CD2 


LEU A 245 


-10.982 


ATOM 


1882 


C 


LEU A 245 


-12.036 


ATOM 


1883 


o 


LEU A 245 


-13.239 


ATOM 


1884 


N 


TYR A 24 6 


-11.092 


ATOM 


1885 


CA 


TYR A 24 6 


-11.399 


ATOM 


1886 


CB 


TYR A 246 


-10.132 


ATOM 


1887 


CG 


TYR A 24 6 


-10.344 


ATOM 


1888 


CD1 


TYR A 24 6 


-11.026 


ATOM 


1889 


CE1 


TYR A 24 6 


-11.136 


ATOM 


1890 


CD2 


TYR A 24 6 


-9.791 


ATOM 


1891 


CE2 


TYR A 24 6 


-9.898 


ATOM 


1892 


CZ 


TYR A 24 6 


-10.566 


ATOM 


1893 


OH 


TYR A 24 6 


-10.633 


ATOM 


1894 


C 


TYR A 246 


-12.024 


ATOM 


1895 


O 


TYR A 246 


-11.565 


ATOM 


1896 


N 


ASN A 247 


-13.069 


ATOM 


1897 


CA 


ASN A 247 


-13.731 


ATOM 


1898 


CB 


ASN A 247 


-15.238 


ATOM 


1899 


CG 


ASN A 247 


-15.970 


ATOM 


1900 


OD1 


ASN A 247 


-15.626 


ATOM 


1901 


ND2 


ASN A 247 


-16.990 


ATOM 


1902 


C 


ASN A 247 


-13.243 


ATOM 


1903 


O 


ASN A 247 


-13.475 


ATOM 


1904 


N 


PRO A 24 8 


-12.560 


ATOM 


1905 


CD 


PRO A 248 


-12.191 


ATOM 


1906 


CA 


PRO A 24 8 


-12.006 


ATOM 


1907 


CB 


PRO A 24 8 


-11.124 


ATOM 


1908 


CG 


PRO A 24 8 


-10.870 


ATOM 


1909 


c 


PRO A 24 8 


-13.017 


ATOM 


1910 


O 


PRO A 24 8 


-12.714 


ATOM 


1911 


N 


THR A 24 9 


-14 .214 


ATOM 


1912 


CA 


THR A 24 9 


-15.248 


ATOM 


1913 


CB 


THR A 24 9 


-16.131 


ATOM 


1914 


OG1 


THR A 24 9 


-17.142 


ATOM 


1915 


CG2 


THR A 24 9 


-15.295 


ATOM 


1916 


C 


THR A 24 9 


-16.149 


ATOM 


1917 


O 


THR A 24 9 


-16.839 


ATOM 


1918 


N 


THR A 250 


-16.140 


ATOM 


1919 


CA 


THR A 250 


-16.996 


ATOM 


1920 


CB 


THR A 250 


-17.789 


ATOM 


1921 


OG1 


THR A 250 


-18.606 


ATOM 


1922 


CG2 


THR A 250 


-18.686 


ATOM 


1923 


C 


THR A 250 


-16.257 


ATOM 


1924 


O 


THR A 250 


-16.861 


ATOM 


1925 


N 


TYR A 251 


-14.949 


ATOM 


1926 


CA 


TYR A 251 


-14.166 


ATOM 


1927 


CB 


TYR A 251 


-14.231 


ATOM 


1928 


CG 


TYR A 251 


-13.629 


ATOM 


1929 


CD1 


. TYR A 251 


-14.351 


ATOM 


1930 


CE1 


. TYR A 251 


-13.765 


ATOM 


1931 


CD2 


! TYR A 251 


-12.313 


ATOM 


1932 


CE2 TYR A 251 


-11.719 


ATOM 


1933 


CZ 


TYR A 251 


-12.444 


ATOM 


1934 


OH 


TYR A 251 


-11.837 


ATOM 


1935 


C 


TYR A 251 


-14.745 


ATOM 


1936 


O 


TYR A 251 


-15.039 


ATOM 


1937 


N 


GLN A 252 


-14.933 


ATOM 


1938 


CA 


GLN A 252 


-15.452 


ATOM 


1939 


CB 


GLN A 252 


-16.946 


ATOM 


1940 


CG 


GLN A 252 


-17.832 


ATOM 


1941 


CD 


GLN A 252 


-17.663 


ATOM 


1942 


OE1 GLN A 252 


-17.677 


ATOM 


1943 


NE2 GLN A 252 


-17.504 


ATOM 


1944 


C 


GLN A 252 


-14.745 


ATOM 


1945 


O 


GLN A 252 


-13.975 
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65.066 32.686 1.00 41.36 AAAA 

65.991 33.205 1.00 45.46 AAAA 

65.345 32.080 1.00 44.42 AAAA 

66.280 29.697 1.00 35.85 AAAA 

66.039 29.496 1.00 35.41 AAAA 

67.375 29.255 1.00 33.97 AAAA 

68.367 28.457 1.00 32.66 AAAA 

68.074 26.973 1.00 30.77 AAAA 

67.835 26.095 1.00 29.40 AAAA 

66.940 26.798 1.00 29.86 AAAA 

67.200 24.803 1.00 29.46 AAAA 

69.773 28.776 1.00 33.23 AAAA 

70.004 28.919 1.00 32.27 AAAA 

70.709 28.875 1.00 32.69 AAAA 

72.108 29.181 1.00 30.97 AAAA 

72.831 29.586 1.00 29.20 AAAA 

74.253 30.040 1.00 27.33 AAAA 

74.538 31.231 1.00 25.88 AAAA 

75.856 31.700 1.00 23.69 AAAA 
75.315 29.325 1.00 25.84 AAAA 
76.625 29.785 1.00 24.41 AAAA 
76.885 30.971 1.00 23.70 AAAA 
78.170 31.424 1.00 24.81 AAAA 
72.872 28.025 1.00 30.75 AAAA 
72.793 26.892 1.00 31.52 AAAA 
73.627 28.320 1.00 30.78 AAAA 

74.413 27.301 1.00 31.14 AAAA 
74.339 27.485 1.00 32.19 AAAA 
75.131 26.452 1.00 35.34 AAAA 
76.282 26.181 1.00 36.08 AAAA 
74.531 25.861 1.00 37.73 AAAA 
75.846 27.488 1.00 30.59 AAAA 
76.440 28.529 1.00 30.73 AAAA 
76.415 26.480 1.00 30.11 AAAA 
75.736 25.234 1.00 29.14 AAAA 
77.777 26.487 1.00 31.18 AAAA 
77.805 25.234 1.00 30.32 AAAA 
76.374 24.941 1.00 29.86 AAAA 
78.913 26.461 1.00 31.26 AAAA 
80.034 26.853 1.00 31.72 AAAA 
78.618 25.981 1.00 31.91 AAAA 
79.621 25.883 1.00 33.02 AAAA 
79.381 24.643 1.00 34.21 AAAA 

78.414 24.951 1.00 35.70 AAAA 
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ATOM 3971 O LEU B 25 -17,573 106.416 19.271 1.00 26.52 BBBB 

ATOM 3972 N SER B 26 -19.788 106.098 18.980 1.00 27.04 BBBB 

ATOM 3973 CA SER B 26 -19.866 104.888 19.811 1.00 25.76 BBBB 

ATOM 3974 CB SER B 26 -21.257 104.280 19.706 1.00 25.62 BBBB 

ATOM 3975 OG SER B 26 -21.779 103.944 20.974 1.00 27.77 BBBB 

ATOM 3976 C SER B 26 -18.865 103.913 19.234 1.00 25.29 BBBB 

ATOM 3977 O SER B 26 -17.892 103.529 19.870 1.00 26.11 BBBB 

ATOM 3978 N LEU B 27 -19.128 103.536 17.994 1.00 26.18 BBBB 

ATOM 3979 CA LEU B 27 -18.297 102.628 17.224 1.00 26.87 BBBB 

ATOM 3980 CB LEU B 27 -18.749 102.713 15.764 1.00 24.98 BBBB 

ATOM 3981 CG LEU B 27 -18.011 102-112 14.571 1.00 24.04 BBBB 

ATOM 3982 CDl LEU B 27 -17.071 101.006 14.966 1.00 22.43 BBBB 

ATOM 3983 CD2 LEU B 27 -19.079 101.610 13.615 1.00 22.62 BBBB 

ATOM 3984 C LEU B 27 -16.805 102.920 17.347 1.00 26.99 BBBB 

ATOM 3985 O LEU B 27 -16.012 102.036 17.612 1.00 25.04 BBBB 

ATOM 3986 N GLN B 28 -16.428 104.175 17.169 1.00 29.66 BBBB 

ATOM 3987 CA GLN B 28 -15.020 104.553 17.231 1.00 32.69 BBBB 

ATOM 3988 CB GLN B 28 -14.842 106.025 16.880 1.00 33.16 BBBB 

ATOM 3989 CG GLN B 28 -13.396 106.441 16.825 1.00 32.72 BBBB 

ATOM 3990 CD GLN B 28 -13.234 107.908 16.546 1.00 33.54 BBBB 

ATOM 3991 OE1 GLN B 28 -12.139 108.367 16.248 1.00 34.63 BBBB 

ATOM 3992 NE2 GLN B 28 -14.323 108.660 16.649 1.00 31.86 BBBB 

ATOM 3993 C GLN B 28 -14.357 104.300 18.565 1.00 33.61 BBBB 

ATOM 3994 O GLN B 28 -13.223 103.843 18.620 1.00 33.71 BBBB 

ATOM 3995 N ARG B 29 -15.054 104.626 19.642 1.00 34.48 BBBB 

ATOM 3996 CA ARG B 29 -14.501 104.421 20.970 1.00 34.93 BBBB 

ATOM 3997 CB ARG B 29 -15.443 105.012 22.024 1.00 36.92 BBBB 

ATOM 3998 CG ARG B 29 -15.065 104.716 23.475 1.00 41.17 BBBB 

ATOM 3999 CD ARG B 29 -15.980 105.446 24.447 1.00 43.84 BBBB 

ATOM 4000 NE ARG B 29 -17.350 105.476 23.947 1.00 48.80 BBBB 

ATOM 4 001 CZ ARG B 29 -17.805 106.359 23.061 1.00 51.15 BBBB 

ATOM 4002 NH1 ARG B 29 -16.993 107.298 22.582 1.00 52.44 BBBB 

ATOM 4003 NH2 ARG B 29 -19.063 106.293 22.637 1.00 52.14 BBBB 

ATOM 4 004 C ARG B 29 -14.310 102.928 21.209 1.00 33.29 BBBB 

ATOM 4 005 O ARG B 29 -13.224 102.469 21.578 1.00 32.82 BBBB 

ATOM 4006 N MET B 30 -15.374 102.173 20.984 1.00 30.23 BBBB 

ATOM 4007 CA MET B 30 -15.321 100.746 21.189 1.00 28.48 BBBB 

ATOM 4008 CB MET B 30 -16.583 100.094 20.625 1.00 24.81 BBBB 

ATOM 4009 CG MET B 30 -16.562 98.570 20.609 1.00 18.87 BBBB 

ATOM 4010 SD MET B 30 -16.271 97.870 22.242 1.00 15.53 BBBB 

ATOM 4 011 CE MET B 30 -17.939 97.726 22.87 9 1.00 11.94 BBBB 

ATOM 4012 C MET B 30 -14.094 100.088 20.574 1.00 29.20 BBBB 

ATOM 4013 O MET B 30 -13.468 99.222 21.184 1.00 29.58 BBBB 

ATOM 4014 N PHE B 31 -13.720 100.511 19.378 1.00 29-35 BBBB 

ATOM 4015 CA-PHE-B 31 -12.611 99.853 18*. 718 1.00 30.40 BBBB 

ATOM 4016 CB PHE B 31 -13.059 99.417 17.319 1.00 27.67 BBBB 

ATOM 4017 CG PHE B 31 -14.189 98.413 17.320 1.00 24.90 BBBB 

ATOM 4018 CDl PHE B 31 -13.932 97.056 17.396 1.00 24.37 BBBB 

ATOM 4 019 CD2 PHE B 31 -15.505 98.830 17.219 1.00 22.85 BBBB 

ATOM 4020 CE1 PHE B 31 -14.967 96.140 17.365 1.00 24.21 BBBB 

ATOM 4 021 CE2 PHE B 31 -16.541 97.923 17.187 1.00 21.02 BBBB 

ATOM 4022 CZ PHE B 31 -16.280 96.584 17.256 1.00 22.95 BBBB 

ATOM 4 023 C PHE B 31 -11.276 100.571 18.627 1.00 32.28 BBBB 

ATOM 4024 O PHE B 31 -10.268 99.923 18.376 1.00 34.41 BBBB 

ATOM 4025 N ASN B 32 -11.249 101.885 18.826 1.00 33.84 BBBB 

ATOM 4026 CA ASN B 32 -9.995 102.621 18.726 1.00 35.06 BBBB 

ATOM 4027 CB ASN B 32 -10.119 104.011 19.368 1.00 38.78 BBBB 

ATOM 4 028 CG ASN B 32 -8.902 104.885 19.115 1.00 42.37 BBBB 

ATOM 4 029 OD1 ASN B 32 -7.937 104.435 18.512 1.00 4 4.20 BBBB 

ATOM 4030 ND2 ASN B 32 -8.942 106.132 19.573 1.00 48.31 BBBB 

ATOM 4031 C ASN B 32 -8.912 101.834 19.431 1.00 35.47 BBBB 

ATOM 4 032 O ASN B 32 -9.040 101.527 20.608 1.00 37.31 BBBB 

ATOM 4 033 N ASN B 33 -7.87 4 101.456 18.695 1.00 35.33 BBBB 

ATOM 4034 CA ASN B 33 -6.725 100.732 19.259 1.00 35.35 BBBB 

ATOM 4035 CB ASN B 33 -6.253 101.418 20.547 1.00 35.26 BBBB 

ATOM 4 036 CG ASN B 33 -5.714 102.7 99 20.305 1.00 36.95 BBBB 

ATOM 4037 OD1 ASN B 33 -4.744 102.988 19.559 1.00 37.71 BBBB 

ATOM 4038 ND2 ASN B 33 -6.336 103.785 20.940 1.00 36.44 BBBB 

ATOM 4039 C ASN B 33 -6.674 99.227 19.539 1.00 33.59 BBBB 

ATOM 4040 O ASN B 33 -5.584 98.728 19.793 1.00 34.79 BBBB 

ATOM 4041 N CYS B 34 -7.764 98.475 19.520 1.00 30.48 BBBB 

ATOM 4042 CA CYS B 34 -7.539 97.075 19.842 1.00 30.60 BBBB 

ATOM 4043 C CYS B 34 -6.983 96.191 18.767 1.00 30.29 BBBB 

ATOM 4044 O CYS B 34 -7.362 96.279 17.606 1.00 31.39 BBBB 

ATOM 4045 CB CYS B 34 -8.767 96.389 20.426 1.00 31.16 BBBB 
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ATOM 4121 0 THR B 44 -22.250 101.817 10.941 1.00 25.81 BBBB 

ATOM 4122 N TYR B 45 -23.944 101.353 9.512 1.00 24.59 BBBB 

ATOM 4123 CA TYR B 45 -24.347 102.716 9.190 1.00 23.28 BBBB 

ATOM 4124 CB TYR B 45 -25.405 103.175 10.180 1.00 22.33 BBBB 

ATOM 4125 CG TYR B 45 -26.551 102.210 10.311 1.00 23.06 BBBB 

ATOM 4126 CD1 TYR B 45 -26.393 101.012 10.986 1.00 23.27 BBBB 

ATOM 4127 CE1 TYR B 45 -27.449 100.123 11.117 1.00 26.08 BBBB 

ATOM 4128 CD2 TYR B 45 -27.797 102.499 9.765 1.00 22.19 BBBB 

ATOM 4129 CE2 TYR B 45 -28.860 101.623 9.887 1.00 22.61 BBBB 

ATOM 4130 CZ TYR B 45 -28.685 100.433 10.567 1.00 25.70 BBBB 

ATOM 4131 OH TYR B 45 -29.740 99.555 10.720 1.00 24.88 BBBB 

ATOM 4132 C TYR B 45 -23-267 103.792 9.036 1.00 23.29 BBBB 

ATOM 4133 O TYR B 45 -23.570 104.991 9.019 1.00 22.74 BBBB 

ATOM 4134 N VAL B 46 -22.010 103.384 8.914 1.00 23.44 BBBB 

ATOM 4135 CA VAL B 46 -20.964 104.372 8.717 1.00 24.86 BBBB 

ATOM 4136 CB VAL B 46 -19.551 103.725 8-714 1.00 24.27 BBBB 

ATOM 4137 CGI VAL B 46 -18.466 104.804 8.673 1.00 22.13 BBBB 

ATOM 4138 CG2 VAL B 46 -19.381 102.885 9.960 1.00 25.62 BBBB 

ATOM 4139 C VAL B 46 -21.275 105.050 7.374 1.00 25.94 BBBB 

ATOM 4140 O VAL B 46 -21.397 104.393 6.327 1.00 25.97 BBBB 

ATOM 4141 N GLN B 47 -21.439 106.369 7.430 1.00 26.63 BBBB 

ATOM 4142 CA GLN B 47 -21.762 107.170 6.259 1.00 27.06 BBBB 

ATOM 4143 CB GLN B 47 -22.633 108.349 6.665 1.00 23.99 BBBB 

ATOM 4144 CG GLN B 47 -23.942 107.941 7.275 1.00 24.28 BBBB 

ATOM 4145 CD GLN B 47 -25.024 107.721 6.245 1.00 26.17 BBBB 

ATOM 4146 OE1 GLN B 47 -25.494 106.603 6.064 1.00 25.67 BBBB 

ATOM 4147 NE2 GLN B 47 -25.431 108.794 5.563 1.00 25.63 BBBB 

ATOM 4148 C GLN B 47 -20.546 107.667 5.488 1.00 29.06 BBBB 

ATOM 4149 O GLN B 47 -19.412 107.683 5.997 1.00 27.81 BBBB 

ATOM 4150 N ARG B 4 8 -20.815 108.075 4.250 1.00 30.23 BBBB 

ATOM 4151 CA ARG B 48 -19.800 108.562 3.337 1.00 31.26 BBBB 

ATOM 4152 CB ARG B 48 -20.472 109.204 2.125 1.00 31.50 BBBB 

ATOM 4153 CG ARG B 48 -21.397 108.262 1.374 0.01 31.64 BBBB 

ATOM 4154 CD ARG B 48 -22.043 108.945 0.182 0.01 31.84 BBBB 

ATOM ' 4155 NE ARG B 4 8 -22.972 108.061 -0.516 0.01 31.95 BBBB 

ATOM 4156 CZ ARG B 48 -24.081 107.563 0.022 0.01 32.03 BBBB 

ATOM 4157 NH1 ARG B 48 -24.408 107.861 1.272 0.01 32.04 BBBB 

ATOM 4158 NH2 ARG B 48 -24.866 106.766 -0.689 0.01 32.01 BBBB 

ATOM 4159 C ARG B 48 -18.828 109.543 3,969 1.00 32.06 BBBB 

ATOM 4160 O ARG B 48 -19.180 110.283 4.887 1.00 31.63 BBBB 

ATOM 4161 N ASN B 4 9 -17.597 109.523 3.467 1.00 32.85 BBBB 

ATOM 4162 CA ASN B 49 -16.539 110.406 3.931 1.00 32.93 BBBB 

ATOM 4163 CB ASN B 49 -16.906 111.864 3.670 1.00 35.03 BBBB 

ATOM 4164 CG ASN B 4 9 -16.184 112.430 2.471 1.00 38.34 BBBB 

ATOM 4165 OD1 ASN B 49 -16.269 113.625 2.182 1.00 42.12 BBBB 

ATOM 4166 ND2 ASN B 49 -15.458 111.573 1.763 1.00 38.66 BBBB 

ATOM 4167 C ASN B 49 -16.133 110.260 5.377 1.00 31.90 BBBB 

ATOM 4168 O ASN B 49 -15.191 110.920 5.808 1.00 32.59 BBBB 

ATOM 4169 N TYR B 50 -16.840 109.433 6.139 1.00 30.16 BBBB 

ATOM 4170 CA TYR B 50 -16.453 109.246 7.523 1.00 28.87 BBBB 

ATOM 4171 CB TYR B 50 -17.556 108.586 8.325 1.00 30.11 BBBB 

ATOM 4172 CG TYR B 50 -18.487 109.566 8.980 1.00 32.04 BBBB 

ATOM 4173 CD1 TYR B 50 -19.395 110.305 8.225 1.00 33.37 BBBB 

ATOM 4174 CE1 TYR B 50 -20.278 111.198 8.831 1.00 33.21 BBBB 

ATOM 4175 CD2 TYR B 50 -18.476 109.746 10.365 1.00 32.49 BBBB 

ATOM 4176 CE2 TYR B 50 -19.355 110.638 10.980 1.00 33.80 BBBB 

ATOM 4177 CZ TYR B 50 -20.254 111.357 10.203 1.00 33.37 BBBB 

ATOM 4178 OH TYR B 50 -21.142 112.216 10.794 1.00 33.69 BBBB 

ATOM 4179 C TYR B 50 -15.205 108.387 7.553 1.00 28.22 BBBB 

ATOM 4180 O TYR B 50 -15.059 107.440 6.793 1.00 28.43 BBBB 

ATOM 4181 N ASP B 51 -14.299 108.729 8.447 1.00 27.78 BBBB 

ATOM 4182 CA ASP B 51 -13.047 108.026 8.550 1.00 27.08 BBBB 

ATOM 4183 CB ASP B 51 -11.976 108.996 9.002 1.00 28.01 BBBB 

ATOM 4184 CG ASP B 51 -10.764 108.299 9.516 1.00 30.24 BBBB 

ATOM 4185 OD1 ASP B 51 -10.873 107.645 10.569 1.00 30.54 BBBB 

ATOM 4186 OD2 ASP B 51 -9.705 108.399 8.865 1.00 32.87 BBBB 

ATOM 4187 C ASP B 51 -13.057 106.817 9.459 1.00 27.18 BBBB 

ATOM 4188 O ASP B 51 -13.405 106.910 10.628 1.00 27.55 BBBB 

ATOM 4189 N LEU B 52 -12.663 105.680 8.898 1.00 26.87 BBBB 

ATOM 4190 CA LEU B 52 -12.578 104.425 9.623 1.00 27.05 BBBB 

ATOM 4191 CB LEU B 52 -13.468 103.367 8.975 1.00 25.16 BBBB 

ATOM 4192 CG LEU B 52 -14.952 103.512 9.300 1.00 27.07 BBBB 

ATOM 4193 CD1 LEU B 52 -15.789 102.504 8.503 1.00 26.21 BBBB 

ATOM 4194 CD2 LEU B 52 -15.138 103.329 10.800 1.00 27.50 BBBB 

ATOM 4195 C LEU B 52 -11.110 104.007 9.563 1.00 27.79 BBBB 
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ATOM 6071 C ARG B 300 -9.652 77.389 45.167 1.00 75.20 BBBB 

ATOM 6072 0 ARG B 300 -9.587 77.323 43.947 1.00 76.11 BBBB 

ATOM 6073 N ALA B 301 -10.049 76.364 45.917 1.00 73.40 BBBB 

ATOM 6074 CA ALA B 301 -10.400 75.073 45.327 1.00 71.60 BBBB 

ATOM 6075 CB ALA B 301 -9.391 74.022 45.759 1.00 71.77 BBBB 

ATOM 6076 C ALA B 301 -11.806 74.600 45.665 1.00 70.39 BBBB 

ATOM 6077 O ALA B 301 -12.548 75.290 46.352 1.00 71.22 BBBB 

ATOM 6078 N CYS B 302 -12.153 73.408 45.186 1.00 69.63 BBBB 

ATOM 6079 CA CYS B 302 -13.475 72.817 45.410 1.00 69.81 BBBB 

ATOM 6080 C CYS B 302 -13.463 71.542 46.261 1.00 70.92 BBBB 

ATOM 6081 O CYS B 302 -12.996 70.495 45.818 1.00 70.09 BBBB 

ATOM 6082 CB CYS B 302 -14.121 72-460 44.077 1.00 68.40 BBBB 

ATOM 6083 SG CYS B 302 -14.256 73.797 42.864 1.00 65.62 BBBB 

ATOM 6084 N ALA B 303 -13.995 71.615 47.473 1.00 71.91 BBBB 

ATOM 6085 CA ALA B 303 -14.040 70.438 48.327 1.00 71.37 BBBB 

ATOM 6086 CB ALA B 303 -13.972 70.842 49.796 1.00 73.18 BBBB 

ATOM 6087 C ALA B 303 -15.346 69.714 48.028 1.00 71.30 BBBB 

ATOM 6088 O ALA B 303 -16.337 70.340 47.669 1.00 71.01 BBBB 

ATOM 6089 N ALA B 304 -15.342 68.395 48.160 1.00 71.88 BBBB 

ATOM 6090 CA ALA B 304 -16.532 67.606 47.884 1.00 71.97 BBBB 

ATOM 6091 CB ALA B 304 -16.223 66.127 48.032 1.00 71.72 BBBB 

ATOM 6092 C ALA B 304 -17.662 67.994 48.812 1.00 73.18 BBBB 

ATOM 6093 O ALA B 304 -17.469 68.109 50.024 1.00 72.92 BBBB 

ATOM 6094 N CYS B 305 -18.846 68.195 48.239 1.00 75.22 BBBB 

ATOM 6095 CA CYS B 305 -20.012 68.570 49.025 1.00 75.51 BBBB 

ATOM 6096 C CYS B 305 -20.337 67.431 49.998 1.00 77.37 BBBB 

ATOM 6097 O CYS B 305 -20.571 66.292 49.584 1.00 77.10 BBBB 

ATOM 6098 CB CYS B 305 -21.212 68.84 6 4 8.108 1.00 7 4.53 BBBB 

ATOM 6099 SG CYS B 305 -22.580 69.701 48.956 1.00 76.81 BBBB 

ATOM 6100 N ALA B 306 -20.340 67.742 51.291 1.00 78.67 BBBB 

ATOM 6101 CA ALA B 306 -20.623 66.747 52.314 1.00 79.35 BBBB 

ATOM 6102 CB ALA B 306 -19.727 66.981 53.523 1.00 78.45 BBBB 

ATOM 6103 C ALA B 306 -22.087 66.742 52.741 1.00 77.98 BBBB 

ATOM 6104 0 ALA B 306 -22.514 67.578 53.536 1.00 77.53 BBBB 

ATOM 6105 N GLY B 307 -22.845 65.789 52.209 1.00 77.39 BBBB 

ATOM 6106 CA GLY B 307 -24.249 65.674 52.552 1.00 77.35 BBBB 

ATOM 6107 C GLY B 307 -25.027 66.936 52.253 1.00 76.38 BBBB 

ATOM 6108 O GLY B 307 -24.936 67.475 51.152 1.00 77.04 BBBB 

ATOM 6109 N CYS B 309 -26.539 69.823 46.511 1.00 70.90 BBBB 

ATOM 6110 CA CYS B 309 -25.612 70.413 47.468 1.00 70.76 BBBB 

ATOM 6111 C CYS B 309 -26.262 71.632 48.147 1.00 68.92 BBBB 

ATOM 6112 O CYS B 309 -27.456 71.608 48.453 1.00 68.09 BBBB 

ATOM 6113 CB CYS B 309 -24.310 70.801 46.747 1.00 71.77 BBBB 

ATOM 6114 SG CYS B 309 -22.958 71.343 47.841 1.00 72.47 BBBB 

ATOM 6115 N ALA B~310 -25.480 72.687 4 8 . 37 6— -1-. 00-67 .-37 BBBB 

ATOM 6116 CA ALA B 310 -25.966 73.908 49.023 1.00 64.60 BBBB 

ATOM 6117 CB ALA B 310 -24.803 74.864 49.235 1.00 66.24 BBBB 

ATOM 6118 C ALA B 310 -27.070 74.608 48.236 1.00 62.95 BBBB 

ATOM 6119 O ALA B 310 -27.815 75.414 48.778 1.00 62.35 BBBB 

ATOM 6120 N LYS B 311 -27.152 7 4.286 4 6.952 1.00 61.61 BBBB 

ATOM 6121 CA LYS B 311 -28.124 74.845 46.015 1.00 60.10 BBBB 

ATOM 6122 CB LYS B 311 -29.468 75.168 46.672 1.00 59.83 BBBB 

ATOM 6123 CG LYS B 311 -30.533 75.564 45.636 1.00 59.97 BBBB 

ATOM 6124 CD LYS B 311 -31.895 75.830 46.250 1.00 59.88 BBBB 

ATOM 6125 CE LYS B 311 -33.015 75.633 45.233 1.00 58.79 BBBB 

ATOM 6126 NZ LYS B 311 -32.913 76.529 44.050 1.00 59. 99 BBBB 

ATOM 6127 C LYS B 311 -27.615 76.097 45.333 1.00 58.80 BBBB 

ATOM 6128 O LYS B 311 -28.041 77.207 45.640 1.00 58.77 BBBB 

ATOM 6129 N VAL B 312 -26.690 7 5.907 4 4.4 06 1.00 57.83 BBBB 

ATOM 6130 CA VAL B 312 -26.149 77.020 43.659 1.00 56.11 BBBB 

ATOM 6131 CB VAL B 312 -24.741 76.696 43.146 1.00 55.53 BBBB 

ATOM 6132 CGI VAL B 312 -24.079 77.949 42.600 1.00 55.22 BBBB 
ATOM 6133 CG2 VAL B 312 -23.919 7 6.105 4 4.27 6 1.00 55.51 BBBB 

ATOM 6134 C VAL B 312 -27.119 77.184 42.499 1.00 54.69 BBBB 

ATOM 6135 0 VAL B 312 -27.775 76.221 42.111 1.00 54.83 BBBB 

ATOM 6136 N CYS B 313 -27.232 78.396 41.964 1.00 54.15 BBBB 

ATOM 6137 CA CYS B 313 -28.146 78.653 40.855 1.00 52.77 BBBB 

ATOM 6138 C CYS B 313 -27.471 7 9.328 39.667 1.00 51.87 BBBB 

ATOM 6139 O CYS B 313 -26.643 80.227 39.834 1.00 50.44 BBBB 

ATOM 6140 CB CYS B 313 -29.333 79.496 41.339 1.00 52.69 BBBB 

ATOM 6141 SG CYS B 313 -30.388 78.618 42.543 1.00 54.18 BBBB 

ATOM 6142 N ASN B 314 -27.824 78.873 38.466 1.00 50.56 BBBB 

ATOM 6143 CA ASN B 314 -27.269 79.429 37.243 1.00 49.32 BBBB 

ATOM 6144 CB ASN B 314 -27.718 78.629 36.034 1.00 4 9.72 BBBB 

ATOM 6145 CG ASN B 314 -27.279 77.196 36.102 1.00 50.43 BBBB 
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ATOM 6221 N LEU B 325 -33.046 88.375 28.635 1.00 32.45 BBBB 

ATOM 6222 CA LEU B 325 -32.584 89.691 29.051 1.00 31.40 BBBB 

ATOM 6223 CB LEU B 325 -33.396 90.756 28.315 1.00 27.98 BBBB 

ATOM 6224 CG LEU B 325 -33.169 90.698 26.805 1.00 24.69 BBBB 

ATOM 6225 GDI LEU B 325 -33.929 91.799 26.131 1.00 24.32 BBBB 

ATOM 6226 CD2 LEU B 325 -31.696 90.841 26.507 1.00 24.02 BBBB 

ATOM 6227 C LEU B 325 -32.539 89.994 30.556 1.00 32.78 BBBB 

ATOM 6228 0 LEU B 325 -31.683 90.7 61 31.010 1.00 32.35 BBBB 

ATOM 6229 N SER B 326 -33.431 89.37 9 31.325 1.00 33.41 BBBB 

ATOM 6230 CA SER B 326 -33.491 89.614 32.766 1.00 34.34 BBBB 

ATOM 6231 CB SER B 326 -34.213 90.94 8 33.026 1.00 33.53 BBBB 

ATOM 6232 OG SER B 326 -34.685 91.059 34.360 1.00 33.60 BBBB 

ATOM 6233 C SER B 326 -34.236 88.468 33.464 1.00 36.35 BBBB 

ATOM 6234 O SER B 326 -34.681 87.516 32.807 1.00 37.25 BBBB 

ATOM 6235 N ILE B 327 -34.342 88.535 34.7 90 1.00 35.67 BBBB 

ATOM 6236 CA ILE B 327 -35.081 87.515 35.506 1.00 36.61 BBBB 

ATOM 6237 CB ILE B 327 -34.652 87.370 36.967 1.00 35.84 BBBB 

ATOM 6238 CG2 ILE B 327 -35.715 86.612 37.729 1.00 35.55 BBBB 

ATOM 6239 CGI ILE B 327 -33.339 86.593 37.053 1.00 36.06 BBBB 

ATOM 6240 CD1 ILE B 327 -32.893 86.259 38.475 1.00 35.14 BBBB 

ATOM 6241 C ILE B 327 -36.502 88.017 35.469 1.00 37.99 BBBB 

ATOM 6242 0 ILE B 327 -36.780 89.099 35.962 1.00 38.06 BBBB 

ATOM 6243 N ASN B 328 -37.389 87.225 34.876 1.00 40.08 BBBB 

ATOM 6244 CA ASN B 328 -38.791 87.584 34.733 1.00 41.54 BBBB 

ATOM 6245 CB ASN B 328 -39.155 87.640 33.246 1.00 41.43 BBBB 

ATOM 6246 CG ASN B 328 -38.835 86.340 32.506 1.00 44.08 BBBB 

ATOM 6247 OD1 ASN B 328 -39.327 85.268 32.874 1.00 46.44 BBBB 

ATOM 6248 ND2 ASN B 328 -38.012 86.445 31.460 1.00 43.98 BBBB 

ATOM 6249 C ASN B 328 -39.773 86.660 35.456 1.00 43.76 BBBB 

ATOM 6250 O ASN B 328 -39.383 85.822 36.274 1.00 43.95 BBBB 

ATOM 6251 N ALA B 329 -41.053 86.837 35.122 1.00 45.84 BBBB 

ATOM 6252 CA ALA B 329 -42.176 86.088 35.680 1.00 46.10 BBBB 

ATOM 6253 CB ALA B 329 -43.452 86.458 34.956 1.00 46.38 BBBB 

ATOM 6254 C ALA B 329 -41.971 84.601 35.580 1.00 46.98 BBBB 

ATOM 6255 0 ALA B 329 -42.801 83.817 36.030 1.00 48.20 BBBB 

ATOM 6256 N THR B 330 -40.874 84.208 34.963 1.00 46.89 BBBB 

ATOM 6257 CA THR B 330 -40.584 82.805 34.831 1.00 47.25 BBBB 

ATOM 6258 CB THR B 330 -40.210 82.500 33.397 1.00 45.36 BBBB 

ATOM 6259 OGl THR B 330 -41.147 83.147 32.532 1.00 43.34 BBBB 

ATOM 6260 CG2 THR B 330 -40.269 81.012 33.148 1.00 46.06 BBBB 

ATOM 6261 C THR B 330 -39.443 82.451 35.787 1.00 49.13 BBBB 

ATOM 6262 0 THR B 330 -39.676 82.028 36.914 1.00 48.65 BBBB 

ATOM 6263 N ASN B 331 -38.215 82.641 35.328 1.00 50.75 BBBB 

ATOM 6264 CA ASN B 331 -37.018 82.364 36.110 1.00 52.60 BBBB 

ATOM 6265 CB ASN B 331 -36.017 83.479 "35;874 1.00 52.-98 - BBBB 

ATOM 6266 CG ASN B 331 -35.841 83.779 34.421 1.00 52.98 BBBB 

ATOM 6267 OD1 ASN B 331 -36.813 83.934 33.686 1.00 51.88 BBBB 

ATOM 6268 ND2 ASN B 331 -34.597 83.869 33.990 1.00 54.62 BBBB 

ATOM 6269 C ASN B 331 -37.223 82.194 37.615 l.OO 53.93 BBBB 

ATOM 6270 O ASN B 331 -36.764 81.214 38.193 1.00 54.41 BBBB 

ATOM 6271 N ILE B 332 -37.885 83.154 38.256 1.00 54.69 BBBB 

ATOM 6272 CA ILE B 332 -38.111 83.056 39.691 1.00 55.47 BBBB 

ATOM 6273 CB ILE B 332 -38.988 84.224 40.211 1.00 55.06 BBBB 

ATOM 6274 CG2 ILE B 332 -38.106 85.413 40.573 1.00 53.92 BBBB 

ATOM 6275 CGI ILE B 332 -40.038 84.607 39.169 1.00 54.95 BBBB 

ATOM 6276 CD1 ILE B 332 -41.135 83.592 38.983 1.00 55.74 BBBB 

ATOM 6277 C ILE B 332 -38.763 81.715 40.027 1.00 57.18 BBBB 

ATOM 6278 O ILE B 332 -38.596 81.185 41.128 1.0057.94 BBBB 

ATOM 6279 N LYS B 333 -39.493 81.159 39.064 1.00 57.71 BBBB 

ATOM 6280 CA LYS B 333 -40.147 79.870 39.243 1.00 57.58 BBBB 

ATOM 6281 CB LYS B 333 -41.103 7 9.588 38.076 1.00 57.74 BBBB 

ATOM 6282 CG LYS B 333 -41.860 78.264 38.165 1.00 57.48 BBBB 

ATOM 6283 CD LYS B 333 -42.754 78.060 36.953 0.01 57.55 BBBB 

ATOM 6284 CE LYS B 333 -43.472 76.723 37.014 0.01 57.56 BBBB 

ATOM 6285 NZ LYS B 333 -44.321 76.603 38.231 0.01 57.58 BBBB 

ATOM 6286 C LYS B 333 -39.040 78.827 39.275 1.00 57.62 BBBB 

ATOM 6287 O LYS B 333 -39.287 77.628 39.149 1.00 59.01 BBBB 

ATOM 6288 N HIS B 334 -37.808 79.301 39.430 1.00 56.75 BBBB 

ATOM 6289 CA HIS B 334 -36.652 78.425 39.482 1-00 56.26 BBBB 

ATOM 6290 CB HIS B 334 -36.354 77.843 38.087 1.00 56.94 BBBB 

ATOM 6291 CG HIS B 334 -35.694 7 6.4 97 38.125 1.00 59.29 BBBB 

ATOM 6292 CD2 HIS B 334 -34.518 76.064 37.612 1.00 59.60 BBBB 

ATOM 6293 ND1 HIS B 334 -36.240 75.419 38.793 l.OO 60.20 BBBB 

ATOM 6294 CE1 HIS B 334 -35.428 74.383 38.692 1.00 60.01 BBBB 

ATOM 6295 NE2 HIS B 334 -34.375 74.747 37.981 1.00 60.38 BBBB 
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90.772 40.122 1.00 47.39 BBBB 

91.047 41.406 1.00 49.17 BBBB 
89.767 41.986 1.00 51.17 BBBB 
90.004 43.279 1.00 52.46 BBBB 
90.730 44.170 1.00 53.72 BBBB 
89.443 43.406 1.00 51.97 BBBB 
91.496 42.261 1.00 50.11 BBBB 
90.684 42.735 1.00 49.11 BBBB 
92.806 42.463 1.00 51.29 BBBB 
93.825 42.161 1.00 51.66 BBBB 
93.374 43.260 1.00 52.10 BBBB 
94.710 43.691 1.00 53.12 BBBB 
95.102 42.476 1.00 52.79 BEBB 
92.508 44.427 1.00 52.44 BBBB 
92.359 44.649 .1.00 52.61 BBBB 
91.931 45.169 1.00 53.29 BBBB 
91.081 46.295 1.00 54.32 BBBB 
90.646 47.127 1.00 54.37 BBBB 
89.771 48.302 1.00 56.94 BBBB 
89.268 49.162 1.00 58.98 BBBB 

90.048 49.625 1.00 59.81 BBBB 
87.958 49.396 1.00 58.67 BBBB 
89.840 45.764 1.00 54.08 BBBB 
89.423 46.276 1.00 54.71 BBBB 
89.262 44.725 1.00 53.57 BBBB 
88.064 44.088 1.00 53.03 BBBB 
87.930 42.690 1.00 55.4 9 BBBB 
86.505 42.156 1.00 60.15 BBBB 
86.402 40.843 1.00 62.85 BBBB 
86.752 40.814 1.00 62.84 BBBB 
85.966 39.837 1.00 65.22 BBBB 
88.080 43.993 1.00 51.01 BBBB 
87.030 43.999 1.00 50.24 BBBB 
89.273 43.928 1.00 49.14 BBBB 
89.411 43.814 1.00 48.22 BBBB 
90.698 43.052 1.00 47.13 BBBB 
90.663 42.075 1.00 46.22 BBBB 
91.134 42.746 1.00 45.00 BBBB 
89.250 -41.523 1.00 45.34 BBBB 
89.380 45.155 1.00 48.47 BBBB 

89.927 45.286 1.00 48.52 BBBB 
88.751 46.154 1.00 48.02 BBBB 
88.620 47.463 1.00 47.49 BBBB 

88.776 48.563 1.00 48.63 BBBB 
90.214 48.-809— 1-.-00-50-. -90 - BBBB 

90.928 49.354 1.00 51.96 BBBB 
90.635 48.450 1.00 51.44 BBBB 
87.239 47.514 1.00 47.18 BBBB 
86.904 48.417 1.00 46.76 BBBB 
86.438 46.513 1.00 46.08 BBBB 
85.100 46.390 1.00 44.33 BBBB 
84.545 45.014 1.00 44.66 BBBB 
83.199 44.814 1.00 45.04 BBBB 
84.471 44.870 1.00 45.30 BBBB 
84.190 43.469 1.00 44.16 BBBB 
85.238 46.535 1.00 44.15 BBBB 
84.354 47.055 1.00 46.18 BBBB 
86.37 4 46.088 1.00 42.97 BBBB 
86.640 46.157 1.00 41.06 BBBB 
87.858 45.297 1.00 39.49 BBBB 
87.716 43.774 1.00 38.64 BBBB 
87.176 43.170 1.00 38.53 BBBB 
89.069 43.190 1.00 37.33 BBBB 
86.877 47.575 1.00 40.15 BBBB 

86.777 47.843 1.00 39.93 BBBB 
87.173 48.477 1.00 39.59 BBBB 
87.468 49.881 1.00 40.17 BBBB 
87.449 50.747 1.00 40.88 BBBB 

88.643 50.555 1.00 40.72 BBBB 
89.282 51.885 1.00 40.02 BBBB 
89.718 52.644 1.00 39.28 BBBB 
90.555 53.826 1.00 39.65 BBBB 

86.644 50.603 1.00 39.31 BBBB 
87.140 51.552 -1.00 40.48 BBBB 
85.409 50.181 1.00 37.60 BBBB 
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34.479 

34.582 

36.025 

36.643 

36.313 

36.608 

36.218 

36.443 

37.911 

38.262 

37.541 

39.380 

36.047 

36.579 

35.100 

34.633 

34.379 

34.035 

33. 867 

35.165 

34.979 

33.352 

33.373 

32.241 

30. 949 

30.016 

29.647 

30.732 

30.284 

30.576 

29.391 

28.702 

27.511 

27.156 

26.885 

25.725 

25.218 

24.100 

22.888 

23.757 

26.113 

26.986 

25.444 

-25-7-4-8- 

25.772 

26.421 

26.811 

25.596 

24.678 

24.773 

25.188 

23.479 

22.507 

21.794 

21.287 

21.196 

22.241 

20.080 

21.512 

21.019 

21.250 

20.331 

21.149 

20.478 

19.678 

21.561 

19.495 

19.631 

18.615 

18.319 

17.746 

17.169 

17.005 

16.665 



28.082 
29.168 
26.979 
26.995 
25.712 
27.132 
26.143 
28.364 
28.684 
29.072 
30.438 
31.420 
30.510 
27.543 
27. 403 
26.733 
25.605 
24.409 
23.105 
21. 957 
21.677 
20.675 
25.998 
26.347 
25.957 
26.289 
25.046 
24.700 
23.862 
27.435 
27.679 
28.139 
29.258 
29.739 
29.150 
30.808 
31.388 
32.590 
33.371 
32.479 
34.564 
31.807 
32.646 
31.214 
■31-7-4 99-- 
30.184 
30.262 
28.872 
27. 971 
28.531 
32.485 
33.490 
32.219 
33.108 
32.359 
30.991 
29.993 
29.706 
29.482 
33.728 
33.061 
35.021 
35.773 
36.680 
37.526 
36.651 
38.293 
36.604 
37.814 
35.943 
34.504 
36.558 
35.342 
34.368 
37.532 



25.349 
25.524 
25.016 
24.808 
25.352 
23.312 
22.638 
22.810 
21.395 
21.151 
21.726 
21.518 
22.445 
20.460 
19.353 
20.910 
20.130 
21.053 
20.337 
21.334 
22.087 
23.174 
19. 408 
18. 223 
20.136 
19.564 
19.562 
20.904 
18.938 
20.315 
21.485 
19.628 
20.242 
19. 438 
18.415 
19.916 
19.255 
20.038 
19.364 
19.160 
20.220 
17.851 
17.652 
16.872 
-15.-4-87- 
14.709 
13.353 
12.860 
12.639 
11.610 
14.855 
14.286 
14.968 
14.362 
13.225 
13.626 
12.464 
11.831 
12.199 
15.352 
16.255 
15.183 
16.038 
16.971 
18.061 
19.026 
18,799 
15.063 
14. 987 
14.297 
14.426 
13.293 
12.582 
13.703 
13.757 



1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 



00 
00 
00 
00 
00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
-1,09- 
1.00 
1.00 
1.00 
1.00 
0.01 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 



71,94 
73.10 
74.58 
76.85 
76.89 
78.13 
77.63 
80.13 
81.57 
83.37 
85.20 
86.49 
84.95 
81.46 
82.36 
80.04 
78.24 
80.51 
82.67 
84.52 
84.21 
84.79 
75.68 
75.44 
71.44 
66.74 
67.32 
67.36 
66.70 
62.38 
61.07 
58.51 
52.88 
4 9.01 
48.74 
46.00 
41.63 
39.78 
38.12 
38.86 
38.65 
41.81 
38.42 
43.34 
-43 .-75 - 
45.59 
49.19 
54.32 
56.21 
55.21 
42.98 
43.41 
40.78 
41.64 
42.58 
47.93 
50.42 
51.30 
51.75 
40.17 
41.50 
37.15 
35.88 
33.00 
31.16 
27.07 
27.37 
35.67 
34.90 
35.91 
34.45 
34.74 
34.47 
33.10 
35.58 
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AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
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AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
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AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
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ATOM 
ATOM 
ATOM 
ATOM 
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ATOM 
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ATOM 
ATOM 
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ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
" " ATOM * 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



896 
897 
898 
899 
900 
901 
902 
903 
904 
905 
906 
907 
908 
909 
910 
911 
912 
913 
914 
915 
916 
917 
918 
919 
920 
921 
922 
923 
924 
925 
926 
927 
928 
929 
930 
931 
932 
933 
934 
935 
936 
937 
938 
939 
940 
941 
942 
943 
944 
945 
946 
947 
948 
949 
950 
951 
952 
953 
954 
955 
956 
957 
958 
959 
960 
961 
962 
963 
964 
965 
966 
967 
968 
969 
97 0 



PRO 
MET 



CA MET 

CB MET 

CG MET 

SD MET 

CE MET 

C MET 



MET 
ARG 



CA ARG 
CB ARG 
CG ARG 
CD ARG 
NE ARG 
CZ ARG 
NH1 ARG 
NH2 ARG 
C ARG 
O ARG 
N ASN 
CA ASN 
CB ASN 
CG ASN 
OD1 ASN 
ND2 ASN 
C ASN 



ASN 
LEU 



CA LEU 
CB LEU 



CG LEU 
CD1 LEU 
CD2 LEU 
C LEU 
O LEU 
N GLN 
CA GLN 
CB GLN 
CG GLN 
CD GLN 
OE1 GLN 
NE2 GLN 
C GLN 
O "GLN 
N GLU 
CA GLU 
CB GLU 
CG GLU 
CD GLU 
OE1 GLU 
OE2 GLU 
C GLU 



GLU 
ILE 



CA ILE 
CB ILE 
CG2 ILE 
CGI ILE 
CD1 ILE 
C ILE 



ILE 
LEU 



CA LEU 
CB LEU 
CG LEU 
CD1 LEU 
CD2 LEU 
C LEU 
O LEU 
N HIS 
CA HIS 
CB HIS 
CG HIS 
CD2 HIS 



112 
113 
113 
113 
113 
113 
113 
113 
113 
114 
114 
114 
114 
114 
114 
114 
114 
114 
114 
114 
115 
115 
115 
115 
115 
115 
115 
115 
116 
116 
116 
116 
116 
116 
116 
116 
117 
117 
117 
117 
117 
117 
117 
117 
A-l'1-7- 
A 118 
118 
118 
118 
118 
118 
118 
118 
118 
119 
119 
119 
119 
119 
119 
119 
119 
120 
120 
120 
120 
120 
120 
120 
120 
121 
121 
121 
121 
121 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 



15.474 
17.067 
16.088 
16.625 
16. 941 
17.417 
18.929 
15.740 
15.867 
15.275 
14.925 
14.659 
13.397 
13.643 
12.440 
12.416 
13.534 
11.281 
13.749 
13.365 
13.178 
12.066 
10.797 
10.284 
9.645 
10.549 
12.384 
11.530 
13.617 
13.991 
15.260 
15.627 
14.439 
16.853 
14.215 
15.335 
13.134 
13.224 
11.978 
12.150 
11.135 
10.084 
11.431 
13.465 
- - 1-3-. 963- 
13.148 
13.370 
12.171 
12.463 
11.484 
11.688 
10.518 
13.728 
13.185 
14 .668 
15.132 
16.615 
17.114 
16.769 
18.197 
14.961 
15.819 
13.831 
13.477 
12.142 
10.950 
9.685 
10.763 
14.528 
14.895 
15.010 
16.032 
15.383 
14.777 
13.486 



37.296 
38.641 
39.616 
40.336 
39.399 
40.236 
41.042 
40.614 
41.831 
40.070 
40.848 
39.907 
39.131 
37.693 
36.888 
35.575 
34.935 
34. 906 
41.810 
42.429 
41.935 
42.852 
42.073 
41.289 
40.251 
41.786 
43.812 
44.577 
43.766 
44.646 
44.159 
44.944 
44.966 
44.311 
46.004 
46.367 
46.761 
48.077 
48.367 
48.039 
47.037 
46.860 
46.386 
49.198 
-50.-254 
48. 985 
50.071 
51.021 
52.382 
53. 447 
54.617 
53.125 
49.786 
48.893 
50.586 
50.547 
50.131 
50.286 



48. 
48. 



681 
207 
51.987 
52.845 
52.235 
53.553 
53.481 
53.124 
52.961 
54.212 
54.186 
55.341 
53.445 
53.973 
54.464 
55.835 
56.245 



13.531 
14.369 
14.848 
16.095 
17.268 
18.806 
18.312 
13.738 
13.902 
12.615 
11.424 
10.259 
10.427 
10.108 
10.258 
10.079 
9.751 
10.208 
11.542 
10.552 
12.735 
12.953 
13.280 
12.105 
12.269 
10.905 
14.095 
14.538 
14.581 
15.667 
16.357 
17.616 
18.575 
18.265 
15.058 
14.684 
14.955 
14.367 
13.542 
12.064 
11.580 
12.193 
10.467 
15.353 
14.-967 
16.626 
17.569 
17.523 
18.129 
17.692 
18.082 
16. 963 
19.032 
19.688 
19.518 
20. 892 
20. 980 
22.407 
20.519 
20.467 
21.382 
21.148 
22.041 
22.559 
23.302 



1.00 37.27 
1.00 34.99 
1.00 36.65 
1.00 34.70 



1.00 33.7 4 
1.00 36.52 
1.00 30.64 
1.00 38.06 
1.00 37.27 
1.00 39.25 
00 40.45 
00 39.26 
00 39.83 
00 40.56 
00 40.35 
00 41.43 
,00 40.24 
.00 41.08 
,00 40.83 
.00 42.04 
.00 40.74 
1.00 40.11 
1.00 40-28 
1.00 39.74 
1.00 40.28 
1.00 41.06 
1.00 41.55 
1.00 39.96 
1.00 43.33 
1.00 44.32 
1.00 4 4.21 
00 44.20 
00 42.58 
00 45.12 
00 46.03 
00 45.09 
00 48.45 
00 51.34 
1.00 51.64 
1.00 53.38 
1.00 54.82 
1.00 57.17 
1.00 55.76 
1.00 52.63 
1. -00-52-61 . 
1.00 53.82 
1.00 55.04 
1.00 57.91 
1.00 61.74 
1.00 64 .10 
1.00 68.14 



1.00 64 
1.00 



.36 



53.78 
1.00 52.95 
1.00 52.65 
1.00 52.4 9 
1.00 52.29 
1.00 52.17 
1.00 51.46 
1.00 49.83 
1.00 51.82 
1.00 50.57 
1.00 52.16 
1.00 52.96 



1.00 



22.403 
23.236 
21.346 
23.450 
23.245 
24.441 
25.340 
26.640 
26.530 
26.554 



52.52 
1.00 55.17 
1.00 54.68 
1.00 55.00 
1.00 53.51 
1.00 54.88 
1.00 54.15 
1.00 53.74 
1.00 55.83 
1.00 59.19 
1.00 58.69 



AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
. AAAA. 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
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26 


.565 


1 


.00 


54 


.88 


AAAA 


ATOM 


1858 


C 


CYS 


A 


240 


-6 


.585 


59 


.537 


26 


.552 


1 


.00 


56 


.05 


AAAA 


ATOM 


1859 


O 


CYS 


A 


240 


-6 


.399 


60 


.578 


27 


.194 


1 


.00 


57 


.09 


AAAA 


ATOM 


1860 


CB 


CYS 


A 


240 


-5 


.912 


57 


.434 


27 


.695 


1 


.00 


54 


.61 


AAAA 


ATOM 


1861 


SG 


CYS 


A 


240 


-4 


.946 


55 


.895 


27 


.726 


1 


.00 


54 


.55 


AAAA 


ATOM 


1862 


N 


PRO 


A 


241 


-7 


.681 


59 


.353 


25 


.805 


1 


.00 


54 


.66 


AAAA 


ATOM 


1863 


CD 


PRO 


A 


241 


-7 


.979 


58 


. 227 


24 


.911 


1 


.00 


54 


.86 


AAAA 


ATOM 


1864 


CA 


PRO 


A 


241 


-8 


.730 


60 


.371 


25 


.731 


1 


.00 


54 


.01 


AAAA 


ATOM 


1865 


CB 


PRO 


A 


241 


-9 


.735 


59 


.760 


24 


.753 


1 


.00 


53 


.92 


AAAA 


ATOM 


1866 


CG 


PRO 


A 


241 


-9 


.474 


58 


.282 


24 


.855 


1 


.00 


55 


.64 


AAAA 


ATOM 


1867 


C 


PRO A 


241 


-9.331 


60 


.626 


27 


.111 


1 


.00 


52 


.89 


AAAA 


ATOM 


1868 


O 


PRO A 


241 


-9.867 


59.713 


27 


.734 


1 


.00 


53 


.84 


AAAA 


ATOM 


1869 


N 


PRO A 


242 


-9 


.255 


61 


.877 


27 


.600 


1 


.00 


51 


.54 


AAAA 


ATOM 


1870 


CD 


PRO 


A 


242 


-8.726 


63.060 


26 


.906 


1 


.00 


49 


.64 


AAAA 
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ATOM 


1871 


CA 


PRO A 242 


-9. 


785 


62. 


253 


28. 


912 


1. 


00 


49. 


70 


AAAA 


ATOM 


1872 


CB 


PRO A 242 


-9. 


528 


63. 


754 


28. 


973 


1. 


00 


48. 


28 


AAAA 


ATOM 


1873 


CG 


PRO A 24 2 


-8. 


382 


63. 


948 


28. 


059 


1. 


00 


49. 


50 


AAAA 


ATOM 


1874 


C 


PRO A 242 


-11. 


258 


61. 


940 


29. 


050 


1. 


00 


51. 


05 


AAAA 


ATOM 


1875 


O 


PRO A 242 


-11. 


97 8 


61. 


849 


28. 


059 


1. 


00 


50. 


69 


AAAA 


ATOM 


1876 


N 


LEU A 243 


-11. 


701 


61. 


782 


30. 


2 92 


1. 


00 


52. 


78 


AAAA 


ATOM 


1877 


CA 


LEU A 243 


-13. 


101 


61. 


502 


30. 


57 9 


1. 


00 


53. 


84 


AAAA 


ATOM 


1878 


CB 


LEU A 243 


-13. 


241 


60. 


867 


31. 


963 


1. 


00 


55. 


15 


AAAA 


ATOM 


1879 


CG 


LEU A 243 


-12. 


465 


59. 


583 


32. 


237 


1. 


00 


57. 


26 


AAAA 


ATOM 


1880 


CD1 


LEU A 243 


-12. 


760 


59. 


142 


33. 


657 


1. 


00 


58. 


83 


AAAA 


ATOM 


1881 


CD2 


LEU A 243 


-12. 


863 


58. 


497 


31. 


245 


1. 


00 


57. 


07 


AAAA 


ATOM 


1882 


C 


LEU A 243 


-13. 


888 


62. 


807 


30. 


551 


1. 


00 


53- 


46 


AAAA 


ATOM 


1883 


O 


LEU A 243 


-15. 


116 


62. 


806 


30. 


4 97 


1. 


00 


54. 


25 


AAAA 


ATOM 


1884 


N 


MET A 244 


-13. 


166 


63. 


918 


30. 


609 


1. 


00 


52. 


58 


AAAA 


ATOM 


1885 


CA 


MET A 244 


-13. 


776 


65. 


236 


30. 


590 


1. 


00 


53. 


34 


AAAA 


ATOM 


1886 


CB 


MET A 244 


-13. 


895 


65. 


795 


32. 


018 


1. 


00 


54. 


69 


AAAA 


ATOM 


1887 


CG 


MET A 244 


-15. 


022 


65. 


185 


32. 


849 


1. 


00 


57. 


09 


AAAA 


ATOM 


1888 


SD 


MET A 244 


-15. 


267 


65. 


986 


34. 


452 


1. 


00 


61. 


40 


AAAA 


ATOM 


1889 


CE 


MET A 244 


-16. 


008 


64 . 


612 


35. 


415 


1 . 


00 


56. 


61 


AAAA 


ATOM 


1890 


C 


MET A 24 4 


-12. 


958 


66. 


195 


29. 


728 


1. 


00 


53. 


36 


AAAA 


ATOM 


1891 


0 


MET A 244 


-11. 


730 


66. 


098 


29. 


657 


1 . 


00 


54. 


33 


AAAA 


ATOM 


1892 


N 


LEU A 245 


-13. 


646 


67. 


110 


29. 


056 


1. 


00 


52. 


53 


AAAA 


ATOM 


1893 


CA 


LEU A 245 


-12. 


981 


68. 


100 


28. 


226 


1. 


00 


51. 


49 


AAAA 


ATOM 


1894 


CB 


LEU A 245 


-13. 


320 


67. 


893 


26. 


754 


1. 


00 


51. 


39 


AAz\A 


ATOM 


1895 


CG 


LEU A 245 


-12. 


686 


66. 


667 


26.094 


1. 


00 


53. 


64 


AAAA 


ATOM 


1896 


CD1 


LEU A 245 


-13. 


435 


66. 


349 


24. 


814 


1. 


00 


53. 


73 


AAAA 


ATOM 


1897 


CD2 


LEU A 245 


-11. 


197 


66. 


912 


25. 


830 


1 . 


00 


50. 


89 


AAAA 


ATOM 


1898 


C 


LEU A 245 


-13. 


436 


69. 


478 


28. 


662 


1. 


00 


51. 


38 


AAAA 


ATOM 


1899 


O 


LEU A 245 


-14. 


595 


69. 


679 


29. 


021 


1. 


00 


51. 


07 


AAAA 


ATOM 


1900 


N 


TYR A 246 


-12. 


507 


70. 


420 


28. 


651 


1. 


00 


50. 


18 


AAAA 


ATOM 


1901 


CA 


TYR A 246 


-12. 


810 


71. 


788 


29. 


020 


1. 


.00 


50. 


.73 


AAAA 


ATOM 


1902 


CB 


TYR A 24 6 


-11. 


519 


72. 


529 


29. 


346 


1. 


00 


47. 


94 


AAAA 


ATOM 


1903 


CG 


TYR A 24 6 


-11. 


699 


73. 


955 


29. 


,790 


1. 


,00 


45. 


,73 


AAAA 


ATOM 


1904 


CD1 


TYR A 246 


-12. 


265 


74 . 


251 


31. 


,032 


1. 


,00 


44. 


,42 


AAAA 


ATOM 


1905 


CE1 


TYR A 246 


-12. 


325 


75. 


562 


31. 


498 


1. 


.00 


43. 


,30 


AAAA 


ATOM 


1906 


CD2 


TYR A 24 6 


-11. 


.212 


75. 


009 


29. 


,013 


1 . 


.00 


45. 


.57 


AAAA 


ATOM 


1907 


CE2 


TYR A 246 


-11. 


,266 


76. 


. 326 


29. 


,466 


1. 


.00 


42, 


. 93 


AAAA 


ATOM 


1908 


CZ 


TYR A 246 


-11. 


. 816 


76. 


,592 


30. 


.711 


1, 


.00 


43. 


. 68 


AAAA 


ATOM 


1909 


OH 


TYR A 246 


-11. 


.804 


77. 


.877 


31, 


.193 


1 , 


.00 


44 . 


.04 


AAAA 


ATOM 


1910 


C 


TYR A 246 


-13, 


.481 


72. 


,443 


27, 


. 816 


1 


.00 


52 


.10 


AAAA 


ATOM 


1911 


O 


TYR A 246 


-13. 


.133 


72. 


.147 


26, 


. 666 


1. 


.00 


53. 


.56 


AAAA 


ATOM 


1912 


N 


ASN A 247 


-14. 


.442 


73. 


.325 


28. 


.080 


1 


.00 


52, 


.10 


AAAA 


ATOM 


1913 


CA 


ASN A 247 


-15. 


.151 


74 . 


.033 


27, 


.022 


1 


.00 


50. 


.83 


AAAA 


ATOM 


1914 


CB 


ASN A 247 


-16 


.650 


73, 


.943 


27. 


.262 


1. 


.00 


50 


.18 


AAAA 


ATOM 


1915 


CG 


ASN A 247 


-17, 


.450 


74, 


.353 


26.056 


1. 


.00 


49 


.02 - 


— AAAA 


ATOM 


1916 


OD1 


ASN A 247 


-17. 


.138 


75.347 


25 


.408 


1 


.00 


47 


.60 


AAAA 


ATOM 


1917 


ND2 


ASN A 247 


-18. 


.4 96 


73 


.591 


25.748 


1 


.00 


47 


.20 


AAAA 


ATOM 


1918 


c 


ASN A 247 


-14 


.714 


75 


.499 


27 


.038 


1 


. 00 


50 


.54 


AAAA 


ATOM 


1919 


O 


ASN A 247 


-15.081 


76 


.255 


27 


.926 


1 


.00 


51 


.49 


AAAA 


ATOM 


1920 


N 


PRO A 248 


-13 


.925 


75 


.920 


26 


. 050 


1 


.00 


50 


.16 


AAAA 


ATOM 


1921 


CD 


PRO A 24 8 


-13 


.412 


75 


.126 


24 


. 924 


1 


.00 


50 


.04 


AAAA 


ATOM 


1922 


CA 


PRO A 248 


-13 


.444 


77 


.303 


25 


. 974 


1 


.00 


50 


.89 


AAAA 


ATOM 


1923 


CB 


PRO A 248 


-12 


.619 


77 


.310 


24 


.691 


1 


.00 


50 


.70 


AAAA 


ATOM 


1924 


CG 


PRO A 24 8 


-12 


.162 


75 


.884 


24 


.565 


1 


.00 


51 


. 21 


AAAA 


ATOM 


1925 


C 


PRO A 248 


-14 


.523 


78 


.394 


25 


. 956 


1 


.00 


52 


. 50 


AAAA 


ATOM 


1926 


o 


PRO A 248 


-14 


.224 


79 


.566 


26 


.210 


1 


. 00 


52 


.35 


AAAA 


ATOM 


1927 


N 


THR A 249 


-15 


.765 


78 


.028 


25 


.646 


1 


. 00 


52 


. 66 


AAAA 


ATOM 


1928 


CA 


THR A 24 9 


-16 


.838 


79 


.017 


25 


.595 


1 


. 00 


54 


. 32 


AAAA 


ATOM 


1929 


CB 


THR A 24 9 


-17 


.575 


79 


.028 


24 


.214 


1 


.00 


54 


. 98 


AAAA 


ATOM 


1930 


OG1 


THR A 24 9 


-18 


.420 


77 


.871 


24 


.096 


1 


.00 


54 


.34 


AAAA 


ATOM 


1931 


CG2 


THR A 24 9 


-16 


.570 


79 


.058 


23 


.064 


1 


.00 


54 


.03 


AAAA 


ATOM 


1932 


C 


THR A 249 


-17 


.869 


78 


.746 


26 


.668 


1 


.00 


54 


. 95 


AAAA 


ATOM 


1933 


O 


THR A 24 9 


-19 


.025 


79 


.129 


26 


.544 


1 


.00 


57 


.09 


AAAA 


ATOM 


1934 


N 


THR A 250 


-17 


.450 


78 


.091 


27 


.734 


1 


.00 


55 


.43 


AAAA 


ATOM 


1935 


CA 


THR A 250 


-18 


.375 


77 


.769 


28 


.806 


1 


.00 


55 


.67 


AAAA 


ATOM 


1936 


CB 


THR A 250 


-18 


.921 


76 


.340 


28 


. 643 


1 


.00 


56 


.52 


AAAA 


ATOM 


1937 


OG1 


THR A 250 


-19 


.473 


76 


.190 


27 


.334 


1 


.00 


59 


.78 


AAAA 


ATOM 


1938 


CG2 


THR A 250 


-19 


.992 


76 


.054 


29 


.669 


1 


.00 


56 


.31 


AAAA 


ATOM 


1939 


C 


THR A 250 


-17 


.659 


77 


.837 


30 


.133 


1 


.00 


55 


.14 


AAAA 


ATOM 


1940 


O 


THR A 250 


-18 


.299 


77 


.905 


31 


.183 


1 


.00 


55 


.72 


AAAA 


ATOM 


1941 


N 


TYR A 251 


-16 


.327 


77 


.825 


30 


.069 


1 


.00 


53 


.45 


AAAA 


ATOM 


1942 


CA 


TYR A 251 


-15 


.495 


77 


.827 


31 


.260 


1 


.00 


50 


.54 


AAAA 


ATOM 


1943 


CB 


TYR A 251 


-15 


.534 


79 


.181 


31 


.947 


1 


.00 


48 


.07 


AAAA 


ATOM 


1944 


CG 


TYR A 251 


-15 


.030 


80 


.275 


31 


.060 


1 


.00 


46 


.80 


AAAA 


ATOM 


1945 


CD1 


TYR A 251 


-15 


.801 


80 


.735 


29 


. 986 


1 


.00 


47 


.87 


AAAA 
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ATOM 


1946 


CE1 


TYR A 251 


-15.316 


ATOM 


1947 


CD2 


TYR A 251 


-13.764 


ATOM 


1948 


CE2 


TYR A 251 


-13.271 


ATOM 


1949 


CZ 


TYR A 251 


-14.051 


ATOM 


1950 


OH 


TYR A 251 


-13.570 


ATOM 


1951 


C 


TYR A 251 


-16.102 


ATOM 


1952 


O 


TYR A 251 


-16.393 


ATOM 


1953 


N 


GLN A 252 


-16.314 


ATOM 


1954 


CA 


GLN A 252 


-16.899 


ATOM 


1955 


CB 


GLN A 252 


-18.413 


ATOM 


1956 


CG 


GLN A 252 


-19.134 


ATOM 


1957 


CD 


GLN A 252 


-18.798 


ATOM 


1958 


OE1 


GLN A 252 


-19.125 


ATOM 


1959 


NE2 


GLN A 252 


-18.123 


ATOM 


1960 


C 


GLN A 252 


-16.342 


ATOM 


1961 


O 


GLN A 252 


-15.834 


ATOM 


1962 


N 


MET A 253 


-16.426 


ATOM 


1963 


CA 


MET A 253 


-15.930 


ATOM 


1964 


CB 


MET A 253 


-15.300 


ATOM 


1965 


CG 


MET A 253 


-14.058 


ATOM 


1966 


SD 


MET A 253 


-12.751 


ATOM 


1967 


CE 


MET A 253 


-12.346 


ATOM 


1968 


C 


MET A 253 


-17.089 


ATOM 


1969 


O 


MET A 253 


-18.081 


ATOM 


1970 


N 


ASP A 254 


-16.963 


ATOM 


1971 


CA 


ASP A 254 


-17.981 


ATOM 


1972 


CB 


ASP A 254 


-18.135 


ATOM 


1973 


CG 


ASP A 254 


-18.710 


ATOM 


1974 


OD1 


ASP A 254 


-18.569 


ATOM 


1975 


OD2 


ASP A 254 


-19.310 


ATOM 


1976 


C 


ASP A 254 


-17.514 


ATOM 


1977 


O 


ASP A 254 


-16.316 


ATOM 


1978 


N 


VAL A 255 


-18.461 


ATOM 


1979 


CA 


VAL A 255 


-18.142 


ATOM 


1980 


CB 


VAL A 255 


-19.310 


ATOM 


1981 


CGI 


VAL A 255 


-19.010 


ATOM 


1982 


CG2 


VAL A 255 


-19.552 


ATOM 


1983 


C 


VAL A 255 


-17.830 


ATOM 


1984 


O 


VAL A 255 


-18.686 


ATOM 


1985 


N 


ASN A 256 


-16.607 


ATOM 


1986 


CA 


ASN A 256 


-16.234 


ATOM 


1987 


CB 


ASN A 256 


-14.720 


ATOM 


1988 


CG 


ASN A 256 


-14.308 


ATOM 


1989 


OD1 


ASN A 256 


-15.119 


ATOM 


1990 


ND2 


ASN A 256 


-13.046 


ATOM 


1991 


C 


ASN A 256 


-16.821 


ATOM 


1992 


O 


ASN A 256 


-16.453 


ATOM 


1993 


N 


PRO A 257 


-17.738 


ATOM 


1994 


CD 


PRO A 257 


-18.205 


ATOM 


1995 


CA 


PRO A 257 


-18.337 


ATOM 


1996 


CB 


PRO A 257 


-19.445 


ATOM 


1997 


CG 


PRO A 257 


-18.836 


ATOM 


1998 


C 


PRO A 257 


-17.317 


ATOM 


1999 


O 


PRO A 257 


-17.508 


ATOM 


2000 


N 


GLU A 258 


-16.233 


ATOM 


2001 


CA 


GLU A 258 


-15.181 


ATOM 


2002 


CB 


GLU A 258 


-14.702 


ATOM 


2003 


CG 


GLU A 258 


-15.646 


ATOM 


2004 


CD 


GLU A 258 


-15.040 


ATOM 


2005 


OE1 


GLU A 258 


-14 . 860 


ATOM 


2006 


OE2 


GLU A 258 


-14.739 


ATOM 


2007 


C 


GLU A 258 


-13.982 


ATOM 


2008 


0 


GLU A 258 


-12.866 


ATOM 


2009 


N 


GLY A 259 


-14.215 


ATOM 


2010 


CA 


GLY A 259 


-13.131 


ATOM 


2011 


C 


GLY A 259 


-12.830 


ATOM 


2012 


O 


GLY A 259 


-13.679 


ATOM 


2013 


N 


LYS A 260 


-11.607 


ATOM 


2014 


CA 


LYS A 260 


-11.161 


ATOM 


2015 


CB' 


LYS A 260 


-10.509 


ATOM 


2016 


CG 


LYS A 260 


-11.434 


ATOM 


2017 


CD 


LYS A 260 


-10.804 


ATOM 


2018 


CE 


LYS A 260 


-11.570 


ATOM • 


2019 


NZ 


LYS A 260 


-13.018 


ATOM 


2020 


C 


LYS A 260 


-10.173 



81. 


735 


29. 


125 


1. 


00 


46. 


95 


AAAA 


80.833 


31. 


262 


1. 


00 


45.01 


AAAA 


81. 


826 


30.418 


1. 


00 


46. 


07 


AAAA 


82. 


274 


29.349 


1. 


00 


47. 


04 


AAAA 


83. 


259 


28. 


513 


1. 


00 


46. 


92 


AAAA 


76.757 


32. 


137 


1. 


00 


50. 


21 


AAAA 


76.977 


33. 


316 


1. 


00 


48. 


01 


AAAA 


75.601 


31. 


516 


1. 


00 


50. 


16 


AAAA 


74. 


457 


32.186 


1.00 


52. 


20 


AAAA 


74. 


485 


32. 


018 


1. 


00 


53- 


68 


AAAA 


73. 


546 


32. 


953 


1. 


00 


57. 


93 


AAAA 


73. 


835 


34. 


398 


1. 


00 


60. 


07 


AAAA 


74. 


906 


34. 


922 


1. 


00 


59.29 


AAAA 


72. 


885 


35. 


050 


1. 


00 


62. 


01 


AAAA 


73. 


155 


31. 


622 


1. 


00 


52.36 


AAAA 


73. 


120 


30. 


506 


1. 


00 


54. 


07 


AAAA 


72. 


085 


32. 


402 


1. 


00 


52. 


31 


AAAA 


70. 


798 


31. 


948 


1. 


00 


51. 


56 


AAAA 


70. 


000 


33. 


100 


1. 


00 


47. 


96 


AAAA 


70. 


603 


33. 


739 


1. 


00 


42. 


98 


AAAA 


70. 


938 


32. 


564 


1. 


00 


41. 


36 


AAAA 


69. 


315 


32. 


007 


1. 


00 


38. 


38 


AAAA 


70. 


002 


31. 


385 


1. 


00 


53. 


96 


AAAA 


69. 


772 


32. 


078 


1. 


00 


54. 


78 


AAAA 


69. 


593 


30. 


125 


1. 


00 


56. 


61 


AAAA 


68. 


7 87 


29. 


459 


1. 


00 


58. 


55 


AAAA 


69. 


205 


27. 


993 


1. 


00 


58. 


42 


AAAA 


70. 


597 


27. 


841 


1. 


00 


59. 


84 


AAAA 


71. 


176 


26. 


747 


1. 


00 


59. 


99 


AAAA 


71. 


112 


28. 


808 


1. 


00 


61. 


08 


AAAA 


67. 


339 


29. 


526 


1. 


00 


60. 


18 


AAAA 


67. 


066 


29. 


.644 


1. 


00 


59. 


54 


AAAA 


66. 


414 


29. 


.451 


1. 


,00 


61. 


.88 


AAAA 


64. 


997 


29. 


.506 


1. 


.00 


63. 


.18 


AAAA 


64. 


195 


30. 


.088 


1. 


.00 


62. 


,28 


AAAA 


62. 


710 


30. 


.002 


1. 


.00 


62. 


,53 


AAAA 


64. 


,608 


31. 


.522 


1. 


.00 


61. 


.52 


AAAA 


64. 


.442 


28. 


.127 


1. 


.00 


64. 


.24 


AAAA 


64. 


.452 


27, 


.252 


1. 


.00 


65, 


.57 


AAAA 


63. 


.960 


27. 


.931 


1. 


.00 


64. 


.99 


AAAA 


63. 


.385 


26, 


.649 


1. 


.00 


65. 


.59 


AAAA 


63. 


.292 


26 


.505 


1, 


.00 


64. 


.80 


AAAA 


62. 


.740 


25 


.154 


1 


.00 


65. 


.41 


AAAA 


62. 


.135 


24 


.450 


1 


.00 


66. 


.45 


AAAA 


62. 


.933 


24 


.787 


1 


.00 


64, 


.10 ■ 


AAAA 


61 


. 984 


26 


.571 


1 


.00 


66 


.88 


AAAA 


61 


.108 


27 


.351 


1 


.00 


67 


.44 


AAAA 


61 


.747 


25 


.622 


1 


.00 


68 


.28 


AAAA 


62 


.633 


24 


.539 


1 


.00 


67 


.80 


AAAA 


60 


.414 


25 


.504 


1 


.00 


68 


.62 


AAAA 


60 


.632 


24 


.480 


1 


.00 


68 


.57 


AAAA 


61 


.656 


23 


.568 


1 


.00 


68 


.81 


AAAA 


59 


.359 


25 


.063 


1 


.00 


68 


.79 


AAAA 


58 


.164 


25 


. 299 


1 


.00 


68 


.77 


AAAA 


59 


.813 


24 


.435 


1 


.00 


68 


.17 


AAAA 


58 


.918 


23 


. 959 


1 


.00 


68 


.45 


AAAA 


59 


.352 


22 


.573 


1 


.00 


70 


.09 


AAAA 


58 


.958 


21 


.458 


1 


.00 


73 


. 60 


AAAA 


59 


.164 


20 


.092 


1 


.00 


75 


.72 


AAAA 


60 


.333 


19 


.694 


1 


.00 


76 


.89 


AAAA 


58 


.154 


19 


.419 


1 


.00 


77 


.34 


AAAA 


58 


.830 


24 


.894 


1 


.00 


67 


.48 


AAAA 


58 


.531 


24 


.461 


1 


.00 


66 


.70 


AAAA 


59 


.085 


26 


.176 


1 


.00 


66 


.17 


AAAA 


59 


.031 


27 


.137 


1 


.00 


63 


.66 


AAAA 


57 


.628 


27 


.616 


1 


.00 


62 


.15 


AAAA 


56 


.733 


27 


.545 


1 


.00 


60 


.63 


AAAA 


57 


.442 


28 


.101 


1 


.00 


61 


.06 


AAAA 


56 


.155 


28 


.619 


1 


.00 


60 


.17 


AAAA 


55 


.336 


27 


.503 


1 


.00 


60 


.58 


AAAA 


55 


.038 


26 


.339 


1 


.00 


61 


.51 


AAAA 


54 


.049 


25 


.375 


1 


.00 


63 


.16 


AAAA 


53 


.993 


24 


.064 


1 


.00 


65 


.08 


AAAA 


53 


.689 


24 


.266 


1 


.00 


66 


.94 


AAAA 


56 


.349 


29 


.776 


1 


.00 


58 


.44 


AAAA 
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ATOM 


2021 


O 


LYS A 260 


-9.785 


57. 


479 


30.088 


1.00 57.91 


AAAA 


ATOM 


2022 


N 


TYR A 261 


-9.772 


55. 


248 


30.409 


1.00 55.67 


AAAA 


ATOM 


2023 


CA 


TYR A 261 


-8.831 


55.309 


31.522 


1.00 52.07 


AAAA 


ATOM 


2024 


CB 


TYR A 261 


-9.318 


54. 


464 


32.690 


1.00 49.48 


AAAA 


ATOM 


2025 


CG 


TYR A 261 


-10.607 


54. 


936 


33.297 


1.00 47.00 


AAAA 


ATOM 


2026 


GDI 


TYR A 261 


-11.832 


54. 


632 


32.715 


1.00 44.34 


AAAA 


ATOM 


2027 


CE1 


TYR A 261 


-13.019 


55. 


084 


33.273 


1.00 43.38 


AAAA 


ATOM 


2028 


CD2 


TYR A 261 


-10.600 


55. 


705 


34.453 


1.00 47.13 


AAAA 


ATOM 


2029 


CE2 


TYR A 261 


-11.785 


56. 


166 


35.017 


1.00 46.03 


AAAA 


ATOM 


2030 


CZ 


TYR A 261 


-12.986 


55. 


852 


34.422 


1.00 44.23 


AAAA 


ATOM 


2031 


OH 


TYR A 261 


-14.142 


56. 


325 


34.991 


1.00 45.72 


AAAA 


ATOM 


2032 


C 


TYR A 261 


-7.447 


54. 


828 


31.148 


1.00 51.74 


AAAA 


ATOM 


2033 


O 


TYR A 261 


-7.286 


53. 


985 


30.267 


1.00 52.11 


AAAA 


ATOM 


2034 


N 


SER A 262 


-6.448 


55. 


373 


31.830 


1.00 50.75 


AAAA 


ATOM 


2035 


CA 


SER A 262 


-5.067 


54. 


980 


31.602 


1.00 50.14 


AAAA 


ATOM 


2036 


CB 


SER A 262 


-4.106 


56. 


119 


31.965 


1.00 49.74 


AAAA 


ATOM 


2037 


OG 


SER A 262 


-4.173 


57. 


192 


31.046 


1.00 48.23 


AAAA 


ATOM 


2038 


C 


SER A 262 


-4.754 


53. 


770 


32.477 


1.00 49.93 


AAAA 


ATOM 


2039 


0 


SER A 262 


-4. 935 


53.802 


33.691 


1.00 49.99 


AAAA 


ATOM 


2040 


N 


PHE A 263 


-4.293 


52. 


700 


31.849 


1.00 50.71 


AAAA 


ATOM 


2041 


CA 


PHE A 263 


-3.927 


51. 


481 


32.559 


1.00 51.05 


AAAA 


ATOM 


2042 


CB 


PHE A 263 


-5.008 


50. 


419 


32.387 


1.00 54.72 


AAAA 


ATOM 


2043 


CG 


PHE A 263 


-4.768 


49. 


167 


33.187 


1.00 59.61 


AAAA 


ATOM 


2044 


CD1 


PHE A 263 


-4.431 


47. 


970 


32.549 


1.00 60.53 


AAAA 


ATOM 


2045 


CD2 


PHE A 263 


-4.906 


49. 


173 


34.578 


1.00 60.22 


AAAA 


ATOM 


2046 


CE1 


PHE A 263 


-4.239 


46. 


802 


33.285 


1.00 60.70 


AAAA 


ATOM 


2047 


CE2 


PHE A 263 


-4.715 


48. 


006 


35.325 


1.00 60.28 


AAAA 


ATOM 


2048 


CZ 


PHE A 263 


-4.384 


46. 


822 


34.67 9 


1.00 60.95 


AAAA 


ATOM 


2049 


C 


PHE A 263 


-2.638 


51 . 


028 


31.904 


1.00 50.56 


AAAA 


ATOM 


2050 


O 


PHE A 263 


-2.646 


50. 


424 


30.827 


1.00 49.12 


AAAA 


ATOM 


2051 


N 


GLY A 264 


-1.523 


51. 


341 


32.547 


1.00 50.31 


AAAA 


ATOM 


2052 


CA 


GLY A 264 


-0.251 


50. 


973 


31.971 


1.00 49.70 


AAAA 


ATOM 


2053 


C 


GLY A 264 


-0.161 


51 . 


724 


30.665 


1.00 49.89 


AAAA 


ATOM 


2054 


O 


GLY A 264 


-0.469 


52. 


910 


30.618 


1.00 52.14 


AAAA 


ATOM 


2055 


N 


ALA A 265 


0.233 


51. 


038 


29. 600 


1.00 49.30 


AAAA 


ATOM 


2056 


CA 


ALA A 265 


0.372 


51. 


670 


28.297 


1.00 47.40 


AAAA 


ATOM 


2057 


CB 


ALA A 265 


1.608 


51. 


139 


27.602 


1.00 46.31 


AAAA 


ATOM 


2058 


C 


ALA A 265 


-0.851 


51, 


438 


27.427 


1.00 47.46 


AAAA 


ATOM 


2059 


0 


ALA A 265 


-0.746 


51. 


,420 


26.203 


1.00 46.68 


AAAA 


ATOM 


2060 


N 


THR A 266 


-2.011 


51, 


,274 


28.055 


1.00 48.17 


AAAA 


ATOM 


2061 


CA 


THR A 266 


-3.241 


51. 


.022 


27.311 


1.00 49.65 


AAAA 


ATOM 


2062 


CB 


THR A 266 


-3.622 


49. 


.508 


27.353 


1.00 48.58 


AAAA 


ATOM 


2063 


OG1 


THR A 266 


-4.029 


49. 


.154 


28.676 


1.00 48.91 


AAAA 


ATOM 


2064 


CG2 


THR A 266 


-2.437 


48. 


.631 


26.961 


1.00 46.18 


AAAA 


ATOM 


2065 


C 


THR A 266 


-4.422 


51. 


.821 


27.860 


1. 00- 50.98 


AAAA 


ATOM 


2066 


O 


THR A 266 


-4.452 


52 


.157 


29.042 


1.00 51.40 


AAAA 


ATOM 


2067 


N 


CYS A 267 


-5.391 


52 


.123 


26.996 


1.00 52.99 


AAAA 


ATOM 


2068 


CA 


CYS A 267 


-6.590 


52 


.860 


27.406 


1.00 55.11 


AAAA 


ATOM 


2069 


C 


CYS A 267 


-7.748 


51 


.875 


27.434 


1.00 56.75 


AAAA 


ATOM 


2070 


O 


CYS A 267 


-7.919 


51 


.090 


26.500 


1.00 57.16 


AAAA 


ATOM 


2071 


CB 


CYS A 267 


-6.899 


53 


. 982 


26.424 


1.00 54.20 


AAAA 


ATOM 


2072 


SG 


CYS A 267 


-5.445 


54 


.981 


25.989 


1.00 57.74 


AAAA 


ATOM 


2073 


N 


VAL A 268 


-8.551 


51 


.926 


28.493 


1.00 58.27 


AAAA 


ATOM 


2074 


CA 


VAL A 268 


-9.665 


50 


.994 


28.639 


1.00 59.25 


AAAA 


ATOM 


2075 


CB 


VAL A 268 


-9.274 


49 


.880 


29.630 


1.00 58.62 


AAAA 


ATOM 


2076 


CGI 


VAL A 268 


-8.097 


49 


.103 


29.086 


1.00 56.64 


AAAA 


ATOM 


2077 


CG2 


: VAL A 268 


-8.897 


50 


.4 91 


30.975 


1.00 57.47 


AAAA 


ATOM 


2078 


C 


VAL A 268 


-10.958 


51 


.652 


29.114 


1.00 60.54 


AAAA 


ATOM 


2079 


O 


VAL A 268 


-10. 937 


52 


.767 


29.629 


1.00 61.52 


AAAA 


ATOM 


2080 


M 


LYS A 269 


-12.078 


50 


.954 


28.951 


1.00 62.34 


AAAA 


ATOM 


2081 


CA 


LYS A 269 


-13.380 


51 


.481 


29.371 


1.00 64.85 


AAAA 


ATOM 


2082 


CB 


LYS A 269 


-14.519 


50 


.526 


28.980 


1.00 65.55 


AAAA 


ATOM 


2083 


CG 


LYS A 269 


-14.599 


50 


.181 


27.4 97 


1.00 65.87 


AAAA 


ATOM 


2084 


CD 


LYS A 269 


-13.430 


49 


.300 


27.080 


1.00 67.45 


AAAA 


ATOM 


2085 


CE 


LYS A 269 


-13.37 6 


48 


.026 


27.921 


1.00 67.70 


AAAA 


ATOM 


2086 


NZ 


LYS A 269 


-12.118 


47 


.260 


27.712 


1.00 68.36 


AAAA 


ATOM 


2087 


C 


LYS A 269 


-13.435 


51 


.728 


30.876 


1.00 65.72 


AAAA 


ATOM 


2088 


O 


LYS A 269 


-13.788 


52 


.815 


31.314 


1.00 66.07 


AAAA 


ATOM 


2089 


N 


LYS A 27 0 


-13.102 


50 


.719 


31.669 


1.00 67.7 9 


AAAA 


ATOM 


2090 


CA 


LYS A 270 


-13.110 


50 


.87 9 


33.122 


1.00 71.12 


AAAA 


ATOM 


2091 


CB 


LYS A 270 


-14.368 


50 


.248 


33.736 


1.00 72.73 


AAAA 


ATOM 


2092 


CG 


LYS A 27 0 


-15.687 


50 


.815 


33.190 


1.00 76.20 


AAAA 


ATOM 


2093 


CD 


LYS A 270 


-15.802 


52 


.330 


33.369 


1.00 76.38 


AAAA 


ATOM 


2094 


CE 


LYS A 270 


-16.264 


52 


.722 


34.767 


1.00 78.14 


AAAA 


ATOM 


2095 


NZ 


LYS A 27 0 


-17.710 


52 


.436 


34.986 


1.00 78.22 


AAAA 
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ATOM 


2096 


C 


LYS A 


270 


-11. 


863 


50. 


257 


33.745 


1. 


00 


71. 


54 


AAAA 


ATOM 


2097 


b 


LYS A 


270 


-11. 


283 


49. 


312 


33.197 


1. 


00 


71. 


77 


AAAA 


ATOM 


2098 


N 


CYS A 


271 


-11- 


436 


50. 


7 96 


34.883 


1. 


00 


70. 


78 


AAAA 


ATOM 


2099 


CA 


CYS A 


271 


-10. 


257 


50. 


255 


35.536 


1. 


00 


70. 


35 


AAAA 


ATOM 


2100 


C 


CYS A 


271 


-10. 


536 


48. 


825 


35.943 


1. 


00 


71. 


95 


AAAA 


ATOM 


2101 


O 


CYS A 


271 


-11. 


689 


48. 


456 


36.176 


1. 


00 


72. 


17 


AAAA 


ATOM 


2102 


CB 


CYS A 


271 


-9. 


887 


51. 


075 


36.767 


1. 


00 


67. 


01 


AAAA 


ATOM 


2103 


SG 


CYS A 


271 


-9. 


176 


52. 


695 


36.367 


1. 


00 


62. 


70 


AAAA 


ATOM 


2104 


N 


PRO A 


272 


-9. 


485 


47. 


993 


36.015 


1. 


00 


73. 


29 


AAAA 


ATOM 


2105 


CD 


PRO A 


272 


-8. 


143 


48. 


228 


35.461 


1. 


00 


73. 


55 


AAAA 


ATOM 


2106 


CA 


PRO A 


272 


-9. 


641 


46. 


589 


36.401 


1. 


00 


74. 


19 ' 


AAAA 


ATOM 


2107 


CB 


PRO A 


272 


-8. 


301 


45. 


965 


36.008 


1. 


00 


74.11 


AAAA 


ATOM 


2108 


CG 


PRO A 


272 


-7. 


800 


46. 


865 


34.918 


1. 


00 


75. 


07 


AAAA 


ATOM 


2109 


C 


PRO A 


272 


-9. 


914 


46. 


456 


37.893 


1. 


00 


75. 


58 


AAAA 


ATOM 


2110 


O 


PRO A 


272 


-10. 


002 


47. 


454 


38.618 


1. 


00 


74. 


67 


AAAA 


ATOM 


2111 


N 


ARG A 


273 


-10. 


029 


45. 


211 


38.343 


1. 


00 


77. 


35 


AAAA 


ATOM 


2112 


CA 


ARG A 


273 


-10. 


301 


44. 


918 


39.742 


1. 


00 


77. 


99 


AAAA 


ATOM 


2113 


CB 


ARG A 


273 


-10. 


834 


43. 


488 


39.875 


1. 


00 


80. 


91 


AAAA 


ATOM 


2114 


CG 


ARG A 


273 


-11. 


848 


43. 


098 


38.805 


1. 


00 


85. 


02 


AAAA 


ATOM 


2115 


CD 


ARG A 


273 


-13. 


084 


43. 


993 


38.836 


1. 


00 


88. 


44 


AAAA 


ATOM 


2116 


NE 


ARG A 


273 


-13. 


889 


43. 


848 


37.624 


1. 


00 


89. 


23 


AAAA 


ATOM 


2117 


CZ 


ARG A 


273 


-14 . 


981 


44. 


558 


37.350 


1. 


00 


89. 


65 


AAAA 


ATOM 


2118 


NH1 


ARG A 


273 


-15. 


415 


45. 


477 


38.205 


1. 


00 


89. 


11 


AAAA 


ATOM 


2119 


NH2 


ARG A 


273 


-15. 


638 


44. 


349 


36.215 


1. 


00 


88. 


99 


AAAA 


ATOM 


2120 


C 


ARG A 


273 


-9. 


061 


45. 


086 


40.619 


1. 


00 


76. 


88 


AAAA 


ATOM 


2121 


O 


ARG A 


273 


-7 . 


973 


44. 


599 


40.289 


1. 


00 


76. 


21 


AAAA 


ATOM 


2122 


N 


ASN A 


274 


-9. 


245 


45. 


799 


41.727 


1. 


00 


74. 


92 


AAAA 


ATOM 


2123 


CA 


ASN A 


274 


-8. 


200 


46. 


043 


42.718 


1. 


00 


73. 


59 


AAAA 


ATOM 


2124 


CB 


ASN A 


274 


-7 . 


4 01 


44. 


754 


42. 968 


1. 


00 


72. 


.11 


AAAA 


ATOM 


2125 


CG 


ASN A 


274 


-6. 


805 


44. 


699 


44 .366 


1. 


00 


70. 


44 


AAAA 


ATOM 


2126 


OD1 


ASN A 


274 


-7 . 


502 


44. 


904 


45.359 


1. 


.00 


66. 


95 


AAAA 


ATOM 


2127 


ND2 


ASN A 


274 


-5. 


512 


44. 


411 


44.447 


1. 


.00 


71. 


.38 


AAAA 


ATOM 


2128 


C 


ASN A 


274 


-7 . 


.251 


47. 


214 


42.433 


1. 


.00 


72. 


.90 


AAAA 


ATOM 


2129 


O 


ASN A 


274 


-6. 


.244 


47. 


392 


43.125 


1. 


.00 


72, 


.58 


AAAA 


ATOM 


2130 


N 


TYR A 


275 


-7 . 


561 


48. 


004 


41.413 


1. 


.00 


71, 


,12 


AAAA 


ATOM 


2131 


CA 


TYR A 


275 


-6. 


,751 


49. 


174 


41.108 


1. 


.00 


68. 


.67 


AAAA 


ATOM 


2132 


CB 


TYR A 


275 


-6. 


.455 


49. 


.276 


39.616 


1. 


.00 


67. 


.88 


AAAA 


ATOM 


2133 


CG 


TYR A 


275 


-5. 


250 


48. 


,481 


39.190 


1, 


. 00 


67, 


.42 


AAAA 


ATOM 


2134 


CD1 


TYR A 


275 


-5. 


.275 


47. 


.089 


39.191 


1, 


.00 


68. 


.27 


AAAA 


ATOM 


2135 


CE1 


TYR A 


275 


-4. 


.152 


46. 


.348 


38.830 


1, 


.00 


68. 


.51 


AAAA 


ATOM 


2136 


CD2 


TYR A 


275 


-4. 


.071 


49. 


.120 


38.815 


1, 


.00 


66. 


.02 


AAAA 


ATOM 


2137 


CE2 


TYR A 


275 


-2 . 


. 945 


48. 


.393 


38.453 


1. 


.00 


67 


.51 


AAAA 


ATOM 


2138 


CZ 


TYR A 


275 


-2. 


.991 


47, 


.005 


38.465 


1, 


.00 


68 


.39 


AAAA 


ATOM 


2139 


OH 


TYR A 


275 


-1, 


.874 


46. 


.276 


38.127 


1 


.00 


67. 


.65 


AAAA 


ATOM 


2140 


C 


TYR A 


275 


-7. 


.575 


50. 


.361 


41.551" 


1 


.00 


67 


.62 


AAAA 


ATOM 


2141 


O 


TYR A 


275 


-8, 


.785 


50. 


.251 


41.710 


1 


.00 


67 


.44 


AAAA 


ATOM 


2142 


N 


VAL A 


276 


-6 


.934 


51 


.496 


41.765 


1 


. 00 


67 


.45 


AAAA 


ATOM 


2143 


CA 


VAL A 


276 


-7 


.676 


52 


.666 


42.192 


1 


.00 


68 


.16 


AAAA 


ATOM 


2144 


CB 


VAL A 


276 


-6 


.932 


53 


.423 


43.296 


1 


.00 


68 


.04 


AAAA 


ATOM 


2145 


CGI 


VAL A 


276 


-7 


.734 


54 


.635 


43.721 


1 


.00 


67 


.32 


AAAA 


ATOM 


2146 


CG2 


VAL A 


276 


-6 


.692 


52 


.501 


44.472 


1 


.00 


66 


.99 


AAAA 


ATOM 


2147 


C 


VAL A 


276 


-7 


.927 


53 


.602 


41.023 


1 


.00 


68 


.99 


AAAA 


ATOM 


2148 


o 


VAL A 


276 


-7 


.072 


53 


.766 


40.151 


1 


.00 


68 


.87 


AAAA 


ATOM 


2149 


N 


VAL A 


277 


-9 


.111 


54 


.207 


41.011 


1 


.00 


70 


.12 


AAAA 


ATOM 


2150 


CA 


VAL A 


277 


-9 


.498 


55 


.134 


39.953 


1 


.00 


70 


.77 


AAAA 


ATOM 


2151 


CB 


VAL A 


277 


-10 


. 972 


54 


.904 


39.539 


1 


.00 
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